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Abstract 

This article presents a measurement of the top quark mass using the CDF II detector at Fermilab. 
ColUding beams of protons and anti-protons at Fermilab's Tevatron = 1.96 TeV) produce 
top/anti-top pairs, which decay to W^W~bb; events are selected where one W decays to hadrons, 
and the other W decays to either e or /x plus a neutrino. The data sample corresponds to an 
integrated luminosity of approximately 318 pb~^. A total of 165 tt events are separated into 
four subsamples based on jet transverse energy thresholds and the number of b jets identified by 
reconstructing a displaced vertex. In each event, the reconstructed top quark invariant mass is 
determined by minimizing a ^oi the ovcrconstrained kinematic system. At the same time, the 
mass of the hadronically decaying W boson is measured in the same event sample. The observed 
W boson mass provides an in situ improvement in the determination of the hadronic jet energy 
scale, JES. A simultaneous likelihood fit of the reconstructed top quark masses and the W boson 
invariant masses in the data sample to distributions from simulated signal and background events 
gives a top quark mass of 173.5 (stat. + JES) ± 1.3 (other syst.) GeV/c^, or 173.5 GeV/cP. 

PACS numbers: 13.85Ni, 13.85Qk, 14.65Ha 
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I. INTRODUCTION 



The top quark is the heaviest observed elementary particle, with a mass roughly 40 times 
larger than the mass of the b quark. This property of the top quark produces large contribu- 
tions to electroweak radiative corrections, making more accurate measurements of the top 
quark mass important for precision tests of the standard model and providing tighter con- 
straints on the mass of the putative Higgs particle. The near-unity Yukawa coupling of the 
top quark also hints at a role for the particle in electroweak symmetry breaking. Improved 
measurements of the top quark mass are key not only for completing our current description 
of particle physics, but also for understanding possible physics beyond the standard model. 

The top quark was first observed in 1995 during the first run of the Fermilab Tevatron, 
by CDF [l| and D0 0]. By the end of Run I, the combined measurement of the top quark 
mass was 178.0 ± 4.3 GeV/c^ [3| using 100-125 pb"^ of data per experiment. This article 
reports a measurement of the top quark mass in the lepton + jets decay channel using the 
upgraded CDF II detector at Fermilab, with 318 pb~^ of pp data collected between February 
2002 and August 2004. A brief overview of the analysis is as follows. 

We scrutinize the data for events where a tt pair has been produced and has decayed 
to two W bosons and two b quarks, where subsequently one W boson decayed to two 
quarks, and the other W boson decayed to an electron or muon and a neutrino. Thus we 
look for a high-energy electron or muon, momentum imbalance in the detector representing 
the neutrino, two jets of particles corresponding to the b quarks, and two additional jets 
corresponding to the hadronic W decay. 

Our measurement uses an observable that is strongly correlated with the top quark pole 
mass, namely the reconstructed top quark mass. This quantity is determined for each event 
by minimizing a function in a kinematic fit to a ti final state 4]. In this fit, we apply 
energy and momentum conservation, constrain both sets of W decay daughters to have the 
invariant mass of the W boson, and constrain both Wb states to have the same mass. The 
mass reconstruction is complicated by an ambiguity as to which jet represents each quark 
in the final state. However, since the above procedure yields an overconstrained system, we 
can choose which jet to assign to each quark based on the fit quality. In addition, some jets 
are experimentally identified as arising from b quarks by utilizing the relatively long lifetime 
of the b quark, reducing the number of allowed jet-quark assignments. 
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The method we use to measure the top c[^ark mass is similar in concept to an analysis 
performed at CDF using data from Run I 5]. We compare the distribution of the recon- 
structed mass from events in the data with the distributions derived from events simulated 
at various values of the top quark mass. We also simulate events from the expected back- 
ground processes. Our measured value is the top quark mass for which the simulated events, 
when combined with the background, best describe the distribution in the data. We improve 
the power of the method by separating the events into four subsamples that have different 
background contamination and different sensitivity to the top quark mass. 

An important uncertainty in top mass measurements arises from the uncertainty in the 
jet energy scale, particularly for the two jets from b quarks that are direct decay products 
of the top quarks. To reduce this uncertainty, we have developed a technique exploiting the 
fact that the daughters of the hadronically decaying W boson should form an invariant mass 
consistent with the precisely known W boson mass. We constrain the jet energy scale by 
comparing the distribution of observed dijet invariant mass for candidate W boson daughter 
jets with simulated distributions assuming various shifts in the jet energy scale with respect 
to our nominal scale. We show that this improves the jet energy scale information and is 
largely independent of the top quark mass. Furthermore, since this information applies in 
large part to b jets as well, it can be used to significantly reduce the uncertainties in the 
overall top quark mass measurement. A measurement of the top quark mass without this 
additional information gives consistent results, albeit with larger overall uncertainties. 

A brief outline of this article is as follows: In Section m we describe the CDF II detector 
used for the analysis and our event selection for tt candidates in the lepton + jets channel, 
and give background estimates. Section IIIII explains the corrections we make to the jets 
measured in our detector, as well as the systematics associated with these corrections that 
dominate top quark mass measurements. Also described in this section is how we reduce 
these systematics using the W dijet mass. The machinery for reconstructing distributions 
of top quark masses and dijet masses is explained in Section HVj and our method for fitting 
these distributions is described in Sectional Section EH gives the results of fits to the data, 
as well as cross-checks for our measurement. The remaining systematics are detailed in 
Section IVIIl and we conclude in Section IVIIII 
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II. DETECTOR, BACKGROUNDS, AND EVENT SELECTION 



This section begins with an explanation of the tt event signature along with a summary 
of the background processes that can mimic it. The relevant parts of the CDF II detector 
are briefly described, as well as the Monte Carlo generation and simulation procedures. The 
event selection and the separation into disjoint subsamples are defined next. Finally, the 
expected number of background events is discussed. 

A. Event Signature 

In the standard model, the top quark decays with a very short hfetime (r 4 x 10~^^ s) 
and with ~100% branching ratio into a W boson and a b quark. The tt event signature is 
therefore determined by the decay products of the two W bosons, each of which can produce 
two quarks or a charged lepton and a neutrino. This analysis considers events in the lepton 
+ jets channel, where one W decays to quarks and the other W decays to eu^ or /xz/^. In 
the following, "lepton" will refer exclusively to a candidate electron or muon. Thus, events 
of interest to this measurement have an energetic e or a neutrino, and four jets, two of 
which are b jets. More jets may be present due to hard gluon radiation from an incoming 
parton (initial state radiation, ISR) or from a final-state quark (final state radiation, FSR). 
Events where a W boson decays to rz/^ can also enter the event sample when a secondary 
electron or muon from the tau decay passes the lepton cuts — about 6% of identified tt events 
have this decay chain. 

There are several non-tt processes that have similar signatures and enter into the event 
sample for this analysis. Events where a leptonically-decaying W boson is found in associ- 
ation with QCD production of at least four additional jets, sometimes including a bb pair, 
have the same signature and are an irreducible background. Singly-produced top quarks, 
e.g. qq — >■ tb, with a Icptonic W decay and additional jets produced via QCD radiation, also 
have the same signature. Additional background events enter the sample when the tt signa- 
ture is faked. For example, a jet can fake an isolated lepton, albeit with small probability, 
a neutrino can be mistakenly inferred when the missing energy in the event is mismeasured, 
and a leptonically decaying Z boson can look like a 1^ if one lepton goes undetected. 
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FIG. 1: An elevation view of the CDF Run II detector. From the collision region outwards, CDF 
consists of a silicon strip detector, a tracking drift chamber, an electromagnetic calorimeter, a 
hadronic calorimeter, and muon chambers. 

B. Detector 

The Collider Detector at Fermilab is a general-purpose detector observing pp collisions at 
Fermilab's Tevatron. The detector geometry is cylindrical, with the z axis pointing along a 
tangent to the Tevatron ring, in the direction of proton flight in the accelerator. Transverse 
quantities such as Ej- and px are magnitudes of projections into the plane perpendicular to 
the z axis. The coordinates x, y, r, and (p are defined in this transverse plane, with the x 
axis pointing outward from the accelerator ring, and the y axis pointing straight up. The 
angle 9 is the polar angle measured from the proton direction, and f] = — In (tan |) is the 
pseudorapidity. When r] is calculated using the reconstructed interaction point, it is referred 
to as 7]evf Figure H shows an elevation view of the CDF detector. The relevant subdetectors 
are described briefly below. A more complete description of the CDF Run II detector is 
provided elsewhere |6|. 

The CDF tracking system is the first detector element crossed by a particle leaving the 
interaction point in the central region. The silicon detectors Q provide three-dimensional 
position measurements with very good resolution for charged particles close to the interaction 




11 



region, allowing extrapolation of tracks back to the collision point and reconstruction of 
secondary, displaced vertices. There are a total of 722,432 channels, with a typical strip pitch 
of 55-65 fim for axial strips, 60-75 fim for 1.2° small-angle stereo strips, and 125-145 /im 
for 90° stereo strips. The silicon detector is divided into three separate subdetectors. The 
layer 00 (LOO) is a single-sided layer of silicon mounted directly on the beampipe (made of 
beryllium), at a radius of 1.4-1.6 cm, providing an axial measurement close to the collision 
point. The SVXII detector is 90 cm long and contains 12 wedges in 0, each with 5 layers 
of silicon at radii from 2.5 cm to 10.6 cm. One side of each layer contains strips oriented 
in the axial direction, and the other side contains 90° stereo strips in three cases, and 1.2° 
small-angle stereo strips in two cases. The Intermediate Silicon Layers (ISL) comprise three 
additional layers of double-sided silicon at larger radii: at 22 cm for |?7| < 1, and at 20 cm 
and 28 cm for 1 < |?7| < 2. Each layer of the ISL provides axial and small-angle stereo 
measurements. 

The Central Outer Tracker (COT) measures particle locations over a large radial 
distance, providing precise measurements of track curvature up to about |?7| = 1. It is a 
large open-cell drift chamber with 8 "superlayers" (4 axial and 4 with a 2° stereo angle), 
each of which contains 12 wire layers, for a total of 96 layers. There are 30,240 wires in 
total. The COT active volume is 310 cm in length and covers 43 cm to 132 cm in radius. 
An axial magnetic field of 1.4 T is provided by a superconducting solenoid surrounding the 
silicon detectors and central drift chamber. 

Particle energies are measured using sampling calorimeters. The calorimeters are seg- 
mented into towers with projective geometry. The segmentation of the CDF calorimeters is 
rather coarse, so that often several particles contribute to the energy measured in one tower. 

In the central region, i.e. |?7| < 1.1, the calorimeter is divided into wedges subtending 15° 
in 0. Each wedge has ten towers, of roughly equal size in rj, on each side of ?7 = 0. The central 
electromagnetic calorimeter (CEM) 9| contains alternating layers of lead and scintillator, 
making 18 radiation lengths of material. The transverse energy resolution for high-energy 
electrons and photons is ^^2:^ = '^-^■5% 02%. Embedded in the CEM is a shower 
maximum detector, the CES, which provides good position measurements of electromagnetic 
showers at a depth of six radiation lengths and is used in electron identification. The 
CES consists of wire proportional chambers with wires and cathode strips providing stereo 
position information. The central hadronic calorimeter (CHA) and the end wall hadronic 
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calorimeter (WHA) are of similar construction, with alternating layers of steel and 
scintillator (4.7 interaction lengths). The WHA fills a gap in the projective geometry between 
the CHA and the plug calorimeter. 

The calorimetry ill] in the end plugs (1 < |?7| < 3.6) has a very complicated tower 
geometry, but the 15° wedge pattern is respected. The plug electromagnetic calorimeter 
(PEM) has lead absorber and scintillating tile read out with wavelength shifting fibers. An 
electron traversing the PEM passes through 23.2 radiation lengths of material. The energy 
resolution for high-energy electrons and photons is = —^=^=©0.7%. There is a shower 
maximum detector (PES), whose scintillating strips measure the position of electron and 
photon showers. The plug hadronic calorimeter (PHA) has alternating layers of iron and 
scintillating tile, for a total of 6.8 interaction lengths. 

Muon identification is performed by banks of single-wire drift cells four layers deep. The 
central muon detector (CMU) [12] is located directly behind the hadronic calorimeter in a 
limited portion of the central region {\ri\ < 0.6). The central muon upgrade (CMP) adds 
additional coverage in the central region and reduces background with an additional 60 cm of 
steel shielding, corresponding to 2.4 interaction lengths at 90°. The central muon extension 
(CMX) covers the region 0.6 < \ri\ < 1.0, and contains eight layers of drift tubes, with the 
average muon passing through six. 

A three-level trigger system is used to select interesting events to be recorded to tape at 
~ 75 Hz from the bunch crossing rate of 1.7 MHz. This analysis uses data from triggers 
based on high-p^ leptons, which come from the leptonically decaying W in the event. The 
first two trigger levels perform limited reconstruction using dedicated hardware, including 
the eXtremely Fast Tracker (XFT), which reconstructs tracks from the COT in the r-0 plane 
with a momentum resolution of better than 2%pj'[GeV/c] 13]. The electron trigger requires 
a coincidence of an XFT track with an electromagnetic cluster in the central calorimeter, 
while the muon trigger requires that an XFT track point toward a set of hits in the muon 
chambers. The third level is a software trigger that performs full event reconstruction. 
Electron and muon triggers at the third level require fully reconstructed objects as in the 
event selection described below, but with looser criteria. 
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C. Monte Carlo Simulation 



This analysis relies on the use of Monte Carlo (MC) event generation and detector sim- 
ulation. Event generation is performed by HERWIG v6.505 14] for ti signal samples, and 
HERWIG, PYTHIA v6.216 and ALPGEN vl.3 Q for background and control samples. 

A detailed description of the CDF detector is used in a simulation that tracks the inter- 
actions of particles in each subdetector and fills data banks whose format is the same as the 
raw data [17 1. The geant package |3| provides a good description of most interactions, 
and detailed models are developed and tuned to describe other aspects (for example, the 
COT ionization and drift properties) so that high-level quantities like tracking efficiency 
and momentum resolution from the data can be reproduced. The calorimeter simulation is 
performed using a parameterized shower simulation (gflash tuned to single particle 
energy response and shower shapes from the data. 



D. Event Selection 

A data sample enriched in tt events in the lepton + jets channel is selected by looking 
for events with an electron (muon) with Et > 20 GeV {pr > 20 GeV/c), missing transverse 
energy > 20 GeV, at least three jets with Et > 15 GeV, and a fourth jet with Et > 
8 GeV. This section describes the event selection in detail. 

Selected events must contain exactly one well identified lepton candidate in events 
recorded by the high-pr lepton triggers. The lepton candidate can be a central electron 
(GEM), or a muon observed in the CMU and CMP detectors (CMUP) or a muon observed 
in the CMX detector (CMX). The trigger efficiencies for leptons in the final sample are high, 
~ 96% for electrons and ~ 90% for muons, and show negligible pt dependence. 

Electrons are identified by a high-momentum track in the tracking detectors matched 
with an energy cluster in the electromagnetic calorimeter with Et > 20 GeV. The rate of 
photons and hadronic matter faking electrons is reduced by requiring the ratio of calorimeter 
energy to track momentum to be no greater than 2 (unless pt > 50 GeV/c, in which case 
this requirement is not imposed), and by requiring the ratio of hadronic to electromagnetic 
energy in the calorimeter towers to be less than 0. 055 + 0. 00045-i?£;M- Isolated electrons from 
W decays are preferentially selected over electrons from 6 or c quark semi-leptonic decays by 
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requiring the additional calorimeter energy in a cone of AR = a/A^M-At/I^ = 0.4 around 
the cluster to be less than 10% of the cluster energy. Electrons are rejected if they come 
from photon conversions to e~^e~ pairs that have been explicitly reconstructed. 

Muons are identified by a high-momentum track in the tracking detectors {pt > 
20 GeV/c) matched with a set of hits in the muon chambers. The calorimeter towers 
to which the track points must contain energy consistent with a minimum ionizing particle. 
An isolation cut is imposed, requiring the total calorimeter energy in a cone of AR = 0.4 
around the muon track (excluding the towers through which the muon passed) to be less 
than 10% of the track momentum. Cosmic ray muons explicitly identified are rejected. A 
complete description of electron and muon selection, including all additional cuts used, can 
be found elsewhere |20i] . 

A neutrino from the leptonic W boson decay is inferred when the observed momentum 
in the transverse plane does not balance. The missing transverse energy, ^t, is formed 
by projecting each tower energy in the central, wall, and plug calorimeters into the plane 
transverse to the beams and summing: = — lEj-^r^ill' where rii is the unit vector in 
the transverse plane that points to the ith calorimeter tower. The is corrected using the 
muon track momentum when a muon is identified in the event. For clusters of towers that 
have been identified as jets, we apply an additional correction to the due to different 
detector response relative to the fiducial central region and due the effects of multiple pp 
interactions. We require the to be at least 20 GeV. 

Jets are identified by looking for clusters of energy in the calorimeter using a cone algo- 
rithm, JETCLU, where the cone radius is AR = 0.4. Towers with Et > 1 GeV are used as a 
seed for the jet search, then nearby towers are added to the clusters, out to the maximum 
radius of 0.4. A final step of splitting and merging is performed such that a tower does 
not contribute to more than one jet. More details about the jet clustering are available 
elsewhere j^^. Jet energies are corrected for relative detector response and for multiple 
interactions, as described in Section Fill Al 

Jets can be identified as b jets using a displaced vertex tagging algorithm, which proceeds 
as follows. The primary event vertex is identified using a fit to all prompt tracks in the 
event and a beamline constraint. The beamline is defined as a linear fit to the collection 
of primary vertices for particular running periods. The luminous region described by the 
beamline has a width of approximately 30 fim in the transverse view and 29 cm in the z 
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FIG. 2: The efficiency of the secondary vertex 6-tagging algorithm is shown as a function of jet 
Et for b jets in the central region of the detector < 1), where the tracking efficiency is high. 
The shaded band gives the ±1 a range for 6-tagging efficiency. The curve is measured using a 
combination of data and Monte Carlo simulated samples. 

direction. Jets with > 15 GeV are checked for good-quality tracks with both COT and 
silicon information. When a secondary vertex can be reconstructed from at least two of 
those tracks, the signed distance between the primary and secondary vertices along the jet 
direction in the plane transverse to the beams {L2d) is calculated, along with its uncertainty 
(cr(L2D))- If L2d/o'{L2d) > 7.5, the jet is considered tagged. The per-jet efficiency for 6 jets 
in the central region is shown as a function of jet Et in Fig.|21 the algorithm has an efficiency 
of about 60% for tagging at least one b jet in a tt event. More information concerning b 



tagging is available elsewhere 
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An additional b tagging algorithm is used only in a cross-check of this analysis, described 
in Section fVl CI The Jet Probability (JPB) tagger [2^ 24 1 calculates the probability of 
observing the r-0 impact parameters of the tracks in the jet with respect to the primary 
interaction vertex, under the hypothesis that the jet does not arise from a heavy-flavor 
quark. In the check described later, a jet is identified as a 6 jet if it has a JPB value less 
than 5%. Since it uses much of the same information, the JPB tag efficiency is correlated 
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with the displaced vertex tag efficiency. 

We require at least four jets in the event with \ri\ < 2.0 in order to reconstruct the tt 
system. In events with more than 4 jets, only the 4 jets with highest Et (the leading 4 jets) 
are used in jet-quark assignments. The events are separated into four subsamples based 
on the jet activity. These four categories of events are found to have different background 
content and different shapes in the reconstruction of the top quark mass for signal events. 
By treating the subsamples separately, the statistical power of the method is improved. 
Double-tagged (2-tag) events have two 6-tagged jets in the event. These events have low 
background contamination, as well as excellent mass resolution, since the number of allowed 
jet-quark assignments is small. In this category, we require three jets with Et > 15 GeV 
and the fourth jet with E^ > 8 GeV. Tight single-tagged (l-tag(T)) events have exactly one 
6-tagged jet in the event, and all four jets with Et > 15 GeV. Loose single-tagged (l-tag(L)) 
events also have exactly one b tag, but the fourth jet has 8 GeV < Et < 15 GeV. These two 
categories have good mass resolution, but l-tag(L) events have a higher background content 
than l-tag(T) events. Finally, 0-tag events have no b tags, and thus a high background 
contamination. To increase the signal to background ratio (S:B), a tighter Et cut is required: 
all four jets must have Et > 21 GeV. 

We find 165 ti candidates in 318 pb^^ of data selected for good quality in all relevant 
subdetectors. The jet selection requirements for each of the four event types are summarized 
in Table HI which also lists the expected signal to background ratio and the number of each 
event type found in the data. The expected S:B assumes a standard model top quark with 
a mass of 178 GeV/c^ (the Run I world average) and a corresponding ti theoretical cross 
section of 6.1 pb. Since in the 0-tag category we do not have an independent background 
estimate, no estimate of S:B is given; about 22 tt events are expected. 

E. Background Estimation 

Wherever possible, we obtain an estimate of the background contamination in each sub- 
sample that is nearly independent of the observed number of events in that subsample; 
adding this information as a constraint in the likelihood fit a priori improves the result. 

The amount and composition of the background contamination depends strongly on the 
number of jets with b tags. In the double 6-tagged sample, the background contribution 
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TABLE I: The selection requirements for the four types of events are given. The subsamples have 
different background content and reconstructed mass shapes. The jet Ej- requirements apply to 
the leading four jets in the event, but additional jets are permitted. Also shown are the number 
of events observed in 318 pb^^ of data, and, for purposes of illustration, the expected signal to 
background ratio (S:B) assuming a tt cross section of 6.1 pb. The 0-tag sample category has no 
independent background estimate. 



Category 


2-tag 


l-tag(T) 


l-tag(L) 


0-tag 


Jet Et jl-j3 
cuts (GeV) j4 


Et > 15 
Et>8 


Et > 15 
Et > 15 


Et > 15 
15 > > 8 


Et > 21 
Et > 21 


^-tagged jets 


2 


1 


1 





Expected S:B 


10.6:1 


3.7:1 


1.1:1 


N/A 


Number of events 


25 


63 


33 


44 



is very small. In the single 6-tagged sample, the dominant backgrounds are W + multijet 
events and non-W QCD events where the primary lepton is not from a W decay. The W 
+ multijet events contain either a heavy flavor jet or a light flavor jet mistagged as a heavy 
flavor jet. In the events with no b tag, W + multijet production dominates, and the jets are 
primarily light flavor since there are no b tags. 

Table ini gives estimates for the background composition in each tagged subsample. Note 
that some of the estimates in Table |n] for the various background processes are correlated, 
so the uncertainty on the total background is not simply the sum in quadrature of the 
component uncertainties. The procedures for estimat ing each background type are described 



in the following sections, and are detailed elsewhere 
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1. Non-W (QCD) background 

For the non-W background (QCD multijet events), a data-driven technique estimates the 
contribution to the signal sample. The sideband regions of the lepton isolation (> 0.2) vs 
^T (< 15 GeV) plane (after subtracting the expected ti and W + multijet contributions) 
are used to predict the number of QCD multijet events in the signal region, assuming no 
correlation between the isolation and ^t- 
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TABLE II: The sources and expected numbers of background events in the three subsamples with 
b tags. 



Source 




Expected Background 




2-tag 


l-tag(T) 


l-tag(L) 


Non-VF (QCD) 


0.31 ±0.08 


2.32 ±0.50 


2.04 ± 0.54 


Wbb+Wcc+Wc 


1.12 ±0.43 


3.91 ± 1.23 


6.81 ± 1.85 


W + hght jets 


0.40 ± 0.08 


3.22 ± 0.41 


4.14 ±0.53 


ww/wz 


0.05 ± 0.01 


0.45 ±0.10 


0.71 ± 0.13 


Single top 


0.008 ± 0.002 


0.49 ± 0.09 


0.60 ±0.11 


Total 


1.89 ±0.52 


10.4 ± 1.72 


14.3 ± 2.45 



2. W + multijet backgrounds 



Simulated samples of + multijet backgrounds are obtained using the ALPGEN gener- 
ator, which produces multiple partons associated with a W boson using an exact leading 
order matrix element calculation. The generator is interfaced with HERWIG to simulate par- 
ton showering and hadronization. ALPGEN describes the kinematics of events with high jet 
multiplicity very well, but suffers from a large theoretical uncertainty in the normalization 
due to the choice of scale and next-to-leading order (NLO) effects. Thus, the normaliza- 
tion for these backgrounds is taken from the data. The normalization for the W + multijet 
background in the subsamples requiring b tags comes from the W + multijet events before 
tagging, after subtracting the expected contributions for tt and non-1^ processes. Due to 
this procedure, the tagged background predictions are weakly coupled to the observed num- 
bers of events in the tagged subsamples. Using the same procedure, the 0-tag background 
estimate would be strongly coupled to the number of observed 0-tag events. In order to 
avoid this correlation in the likelihood fit, no background constraint is used for the 0-tag 
sample. 

The major contributions for the W + heavy flavor backgrounds, i.e. events with a b tag 
on a real 6 or c jet, come from the Wbb, Wcc, and Wc processes. The fractions of inclusive 
W + multijet events that contain bb pairs, cc pairs, and single c quarks are estimated using 
the alpgen/herwig Monte Carlo samples after a calibration to the parallel fractions in 



19 



inclusive jet data. Then the contribution of each background type to the data sample is 
determined by multiplying the corresponding fraction, the event tagging efficiency for the 
particular configuration of b and c jets, and the number of 1^ + multijet events in the data 
before b tagging. 

Another W + multijet contribution comes from events where a light fiavor jet is misiden- 
tified as a heavy fiavor jet. Using jet data events, a per-jet mistag rate is determined as a 
function of the number of tracks, Et, f], and of the jet, and the scalar sum of Et for all 
jets with Et > 10 GeV and |?7| < 2.4. The mistag rate is then applied to pretag data events 
in the signal region to obtain the W + light fiavor contribution. 



3. Other backgrounds 

There are other minor contributions to the backgrounds: diboson production (VTiy, 
WZ, and ZZ) associated with iets, and single top production. We use alpgen Monte 
Carlo samples to estimate their acceptance. The NLO cross section values |2^ |2a| are used 
for normalization. 



III. JET CORRECTIONS AND SYSTEMATICS 



Jets of particles arising from quarks and gluons are the most important reconstructed 
objects in the top quark mass measurement, but are measured with poor energy resolution. 
The jet measurements therefore make the largest contribution to the resolution of the mass 
reconstruction described in Section IIVI Additionally, systematic uncertainties on the jet 
energy measurements are the dominant source of systematic uncertainty on the top quark 
mass. We describe here the corrections applied to the measured jet energies, as well as the 
systematic uncertainties on our modeling of the jet production and detector response. A 
more thorough treatment of these topics is available elsewhere Finally, we introduce the 
jet energy scale quantity JES, which is measured in situ using the W boson mass resonance. 
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A. Jet Corrections 



Matching reconstructed jets to quarks from the tt decay has both theoretical and exper- 
imental complications. A correspondence generally can be assumed between measured jet 
quantities and the kinematics of partons from the hard interaction and decay A series of 
corrections are made to jet energies in order to best approximate the corresponding quark 
energies. Measured jet energies have a poor resolution, and arc treated as uncertain quan- 
tities in the mass reconstruction. The measured angles of the jets, in contrast, are good 
approximations of the corresponding quark angles, so they are used without corrections and 
are fixed in the mass reconstruction. 

1. Tower calibrations 

Before chistcring into jets, the calorimeter tower energies arc calibrated as follows. The 
overall electromagnetic scale is set using the peak of the dielectron mass resonance resulting 
from decays of the Z boson. The scale of the hadronic calorimeters is set using test beam 
data, with changes over time monitored using radioactive sources and the energy deposition 
of muons from J/ijj decays, which are minimum ionizing particles (MIPs) in the calorimeter. 
Tower-to-tower uniformity for the CEM is achieved by requiring the ratio of electromagnetic 
energy to track momentum [E/p) of electrons to be the same across the calorimeter. In 
the CHA and WHA, the J/ip fifi MIPs arc also used to equalize the response of towers. 
For the PEM and PHA, where tracks are not available, the tower-to-tower calibrations use 
a laser calibration system and ^°Co sourcing. The WHA calorimeter also has a sourcing 
system to monitor changes in the tower gains. 

2. Process-independent corrections 

After clustering, jets arc first corrected with a set of "generic" jet corrections, so called 
because they are intended to be independent of the particular process under considera- 
tion. For these corrections, the quark px distribution is assumed to be flat. Since some of 
the corrections are a function of jet pr, and since the jet resolution is non-negligible, this 
assumption has a considerable effect on the derived correction. 

These generic jet corrections scale the measured jet four- vector to account for a set of 
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well studied effects. First, a dijet balancing procedure is used to determine and correct for 
variations in the calorimeter response to jets as a function of r]. These variations are due 
to different detector technology, to differing amounts of material in the tracking volume 
and the calorimeters, and to uninstrumented regions. In dijet balancing, events are selected 
with two and only two jets, one in the well understood central region (0.2 < |?7| < 0.6). 
A correction is determined such that the transverse momentum of the other jet, called the 
probe jet, as a function of its rj, is equal on average to that of the central jet. This relative 
correction ranges from about +15% to —10%, and can be seen in Fig. H in Section ITTTbI 

After a small correction for the extra energy deposited by multiple collisions in the same 
accelerator bunch crossing, a correction for calorimeter non-linearity is applied so that the 
jet energies correspond to the most probable in-cone hadronic energy assuming a flat pt 
distribution. First, the response of the calorimeter to hadrons is measured using E/p oi 
single tracks in the data. Studies of energy flow and jet shapes in the data also constrain the 
modeling of jet fragmentation. After tuning the simulation to model what we observe in the 
data, the correction (+10% to +30%, depending on jet pr) is determined using a simulated 
sample of dijet events covering a large pr range. 

3. Process-specific corrections 

Jet corrections are then applied that have been derived specifically for the ti process. 
These corrections account for shifts in the mean jet energy due to the shape of the pt 
distribution of quarks from ti decay, for the extra energy deposited by remnants of the 
pp collision not involved in the hard interaction ("underlying event"), and for the energy 
falling outside the jet clustering cone. Light-quark jets from W boson decay {W jets) 
and b jets, which have different pt distributions, fragmentation, and decay properties, are 
corrected using different functions, but no separate correction is attempted for b jets with 
identified semi-leptonic decays. Each jet energy is also assigned an uncertainty arising from 
the measurement resolution of the calorimeter. Note that, since these corrections depend 
on the flavor of the jet, they must be applied after a hypothesis has been selected for the 
assignment of the measured jets to quarks from the ti decay chain. 

The tt-specific corrections are extracted from a large sample of herwig ti events (Mtop = 
178 GeV/c^) in which the four leading jets in Et are matched within AR = 0.4 to the four 
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generator-level quarks from tt decay. The correction functions are consistent with those 
extracted from a large pythia sample. The correction is defined as the most probable value 
(MPV) of the jet response {p'i^'^'^^ — p'^^)/p'^^ ■, as a function of j)^^ and r]^^^. Since the r]^^^ 
dependence is negligible for the light-quark jets, their correction depends only on p^^*. The 
MPV is chosen, rather than the mean of the asymmetric distribution, in order to accurately 
correct as many jets as possible in the core of the distribution. This increases the number of 
events for which the correct jet-quark assignment is chosen by the fitter (see below), resulting 
in a narrower core for the reconstructed mass distribution. A corresponding resolution is 
found by taking the symmetric window about the MPV of the jet response that includes 
68% of the total area. Figure El shows the corrections and resolutions as a function of jet pt 
for several values of \r]\. 

As a final step in correcting the jet four- vector, the jet momentum is held fixed while the 
jet energy is adjusted so that the jet has a mass according to its flavor hypothesis. A mass 
of 0.5 GeV is used for W jets, and a mass of 5.0 GeV is used for h jets. This is done to 
match the generator-level quarks used to derive the tt-specific corrections. 

B. Systematics from Jet Energy Scale 

There are significant uncertainties on many aspects of the measurement of jet energies. 
Some of these are in the form of uncertainties on the energy measurements themselves; some 
are uncertainties on the detector simulation, which is used to derive many corrections, and 
ultimately to extract the top quark mass; still others are best understood as theoretical 
uncertainties on jet production and fragmentation models used in the generators. 

1. Calorimeter response relative to central 

The systematic uncertainties in the calorimeter response relative to the central calorim- 
eter range from 0.5% to 2.5% for jets used in this analysis. The uncertainties account for 
the residual rj dependence after dijet balancing, biases in the dijet balancing procedure (es- 
pecially near the uninstrumented regions) and the variation of the plug calorimeter response 
with time. Photon-jet balancing is used to check the rj dependence after corrections in data 
and simulated events, and the residual differences in this comparison are also included in 
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FIG. 3: The tt-specific corrections are shown for W jets (left) and b jets (right) as a function 
of jet pt for several values of \r]\. On the top is the correction factor, and on the bottom is the 
fractional resolution passed to the fitter. The histograms give the distributions of jet px (arbitrarily 
normalized) from a signal Monte Carlo sample with generated top quark mass of 178 GeV/c^. 

the systematic uncertainty. Figure |3 shows the dijet balancing as a function of the probe 
jet pseudorapidity, demonstrating that the simulation models well the detector response for 
\ri\ < 2.0. Since differing response in neighboring regions of the detector is the primary 
source of biased jet angle measurements, the plot also demonstrates that we can expect 
angle biases to be well modeled in the simulated events. 
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FIG. 4: Results of the dijet balancing procedure are shown for data and simulated dijet events 
with p}^^ > 20 GeV. Probe jets from throughout the detector are compared with a reference jet in 
the central region; the ratio of the pT of the jets is plotted as a function of the probe jet r]. The 
simulation models well the detector response as a function of rj. 

2. Modeling of hadron jets 

The main systematic uncertainties at the hadronic level are obtained by propagating the 
uncertainties on the single particle response and the fragmentation, which are determined 
from studies on the data. Smaller contributions are included from the comparison of data and 
Monte Carlo simulation of the calorimeter response close to tower boundaries in azimuth, and 
from the stability of the calorimeter calibration with time. There is also a small uncertainty 
on the energy deposited by additional pp interactions. In all, this uncertainty varies from 
1.5% to 3.0%, depending on jet pt, and only accounts for variations that affect the energy 
inside the jet cone. 

3. Modeling of out- of- cone energy 

The uncertainty on the fraction of energy contained in the jet cone (also primarily due 
to jet fragmentation modeling) is estimated in two parts, one between R = 0.4 and R = 1.3 
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and the other for R > 1.3. This systematic, which is roughly 9% at very low jet pt but falls 
rapidly to < 2% for pr > 70 GeV, is determined by comparing the energy flow in jets from 
data and Monte Carlo for various event topologies. 

4- Modeling of underlying event 

The underlying event deposits energy uniformly in calorimeter towers throughout the 
detector, some of which are clustered into jets. Such energy is subtracted from the jet 
energy in the corrections. The uncertainty on this correction decreases rapidly from 2% at 
very low pr to less than 0.5% at about 35 GeV. 

5. Total uncertainty 

The systematic uncertainties on jet energies for jets in the reference central region (0.2 < 
|?7| < 0.6) are shown as a function of px in Fig. For other rj regions, only the contribution 
of the "relative response" uncertainty changes. The black line gives the total uncertainty on 
the jet energy measurement, obtained by adding in quadrature the contributions described 
above. 

Events in which a jet recoils against a high energy photon are used to check the absolute 
corrections. We compare the corrected jet energy to the photon energy, which is well cali- 
brated using Z e^e~ decays. This 7-jet balancing is performed on data and Monte Carlo 
samples, as a function of photon Et and jet 77, as a cross check of the energy corrections 
and systematic uncertainties described above. Figure IHl shows a comparison of the 7-jet 
balancing in data and Monte Carlo after all jet corrections, along with the ±1 a range of the 
jet energy systematics. The agreement provides confidence that the systematic uncertainties 
are reasonable. 

The systematic uncertainties on jet energies described here are understood to apply to all 
jets. Clearly additional flavor-specific or process-specific uncertainties could be present. In 
particular, any systematics specific to the h jets are extremely important in a measurement 
of the top quark mass, and could arise from mismodeling of h quark fragmentation, semi- 
leptonic decays, or color connections not present in the W boson decay system. Uncertainties 
from these sources have been studied and found to be relatively small; see Section IVII Al 
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FIG. 5: The systematic uncertainties on jet energy are shown for jets in the central calorimeter 
(0.2 < \r]\ < 0.6). For non-central jets, the total uncertainty has a different contribution from the 
eta-dependent uncertainty. In this plot the corrected jet transverse momentum p'^^^ is the process- 
independent estimate of the parton px- At low p^", the main contribution to the systematic is 
from the uncertainty on the fraction of jet energy lost outside the cone, while at high p^" it is 
from the linearity corrections to obtain an absolute jet energy scale. 

C. Jet Energy Scale 

Since the jet energy systematics described in the previous section generate the dominant 
systematic uncertainty on the top quark mass measurement, a method has been developed to 
further constrain those systematics using the W boson mass resonance in situ. In particular, 
we measure a parameter JES that represents a shift in the jet energy scale from our default 
calibration. 

Rather than defining JES as a constant percentage shift of the jet energies, we define it 
in units of the total nominal jet energy scale uncertainty {(Jc), which is derived from the 
extrinsic calibration procedures above. This (Xc is the quantity depicted in Fig. El for central 
jets. Thus JES = o"c corresponds to our default jet energy scale; JES = 1 cxc implies 
a shift in all jet energies by one standard deviation in the uncertainty defined above; and 
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FIG. 6: For 7-jet events in both data and simulation, we find the fractional difference in pT between 
the jet and the photon after all jet corrections are applied. Plotted here, for different ranges of jet 
rj, is the difference between this quantity in data and simulated events as a function of photon px- 
The solid lines show the ±1 a range given by the jet energy systematics. The other lines follow 
the same definitions as in Fig. \^ 



so on. This choice has two consequences. The first is that the effect of a shift in JES is 
different for jets with different Pt and 77. For example, jets with very low pr have a larger 
fractional uncertainty, and therefore have a larger fractional shift with a 1 Uc change in 
JES. The second is that it is easy to incorporate the independent estimate of the jet energy 
systematics (with its pt and r] dependence) by constraining JES using a Gaussian centered 
at cTc and with a width of 1 ctc. 

As described in Section 1111 Bl the jet energy scale uncertainty comprises many small 
effects, which have different dependences on jet rj and pt- With more statistics, we would 
choose to measure the various effects independently. Currently, however, we make the 
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approximation of assuming that a single value of the total jet energy scale parameter JES 
applies to all jets in the sample; that is, we measure a value of JES that is averaged over jets 
in the sample. Additionally, by construction, our JES measurement is primarily sensitive to 
jets from the hadronic W decay. We estimate the effect of this approximation as a systematic 
uncertainty on the top quark mass measurement. 

IV. MASS RECONSTRUCTION 

In this section, we describe the procedures for determining in each event the reconstructed 
top quark mass ml'^'^" and the dijet mass rrijj, representing the mass of the hadronically 
decaying W boson. We then discuss the results of applying these reconstruction techniques. 
Remember that by itself mp'^" is not an event-by-event measurement of the top quark mass; 
rather it is a quantity whose distribution in the data will be compared with simulated 
samples to extract the top quark mass (see Sectional). Similarly, the distribution of mjj 
will be used to constrain the calibration of the jet energy scale in the reconstructed events. 

Throughout the mass reconstruction, each event is assumed to be a tt event decaying in 
the lepton + jets channel, and the four leading jets are assumed to correspond to the four 
quarks from the top and W decays. First, the measured four-vectors for the jets and lepton 
in the event are corrected for known effects, and resolutions are assigned where needed. 
Next, for the top quark mass reconstruction, a fit is used to extract the reconstructed 
mass, so that each event has a particular value of ml^^° and a corresponding value. Some 
events are discarded from the event sample when their minimized exceeds a cut value. 
Meanwhile, for the dijet mass reconstruction, the invariant mass mjj is calculated for each 
pair of jets without b tags among the leading four jets. 

A. Inputs to the mass reconstruction 

The fit takes as input the four-vectors of the jets and lepton identified in the event. All 
known corrections are applied to these 4-vectors, and Gaussian uncertainties are computed 
for the transverse momenta, since they will be permitted to vary in the fit. The treatment 
of the neutrino four-vector is more complicated, since the is a derived quantity, and 
does not have an uncertainty independent of the other measured values. The includes 
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instead information about a related fundamental quantity, the unclustered energy, which is 
described below. 



1. Jet inputs 

The corrections made to the jet four- vectors are described in detail in Section Fill Al To 
summarize, a series of corrections are applied to the jet energies in order to determine the 
energy of the quark corresponding to each jet. The jet angles are relatively well measured, 
and are fixed in the kinematic fit. The final step of the jet corrections is the tt-specific 
correction that treats separately h jets and jets from the W decay, and in addition provides 
for the pt of each jet a resolution that is used in the expression. 

2. Lepton Inputs 

The electron four- vector has energy determined by its electromagnetic calorimeter cluster, 
and angles defined by the associated track. The electron energy is corrected for differences 
in the calorimeter response depending on where in the tower face the electron enters. The 
electron mass is set to zero, and the angles are taken as perfectly measured quantities. The 
transverse momentum {p^ = p sin 6) of the electron is assigned an uncertainty of 



zero. Track curvature corrections due to chamber misalignment are applied. The angles and 
mass are given no uncertainty; the transverse momentum has an uncertainty of 

^ = 0.0011 ■ p^[GeV/c], (IV.2) 
Pt 

The uncertainties on measured electron and muon transverse momenta are obtained from 
studies of leptonic decays. 

3. Neutrino Inputs: Unclustered Energy 

The neutrino in a ti event is not observed; its presence is inferred by an imbalance in the 
observed transverse momentum. Therefore, rather than treating the neutrino four-vector as 




(IV.l) 



The muon four-vector uses the three-vector of the associated track, also with a mass of 
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an independent input to the fit, the measured quantities, as varied in the fit, are used to 
dynamically calculate the neutrino transverse momentum. 

All of the transverse energy in the calorimeter (towers with |?7| < 3.6) that is not as- 
sociated with the primary lepton or one of the leading four jets is considered "unclustered 
energy." For towers clustered into a jet that has Et > 8 GeV and |?7| < 2.0, but that is 
not one of the leading four jets, the tower momenta are replaced with the jet momentum 
after the generic jet corrections described in Section FlII A 21 The rest of the tower momenta 
are multiplied by a scale factor of 1.4, which is the estimated generic correction factor for 
8 GeV jets. Finally, the unclustered energy includes the energy attributed to enter into the 
leading four jets from the underlying event, and excludes the energy thought to fall outside 
the jet cones of the leading four jets. This avoids double-counting of energy that is included 
in the leading four jet energies after all corrections. Each transverse component of the un- 
clustered energy {p^^, Py^) is assigned an uncertainty of 0.4a/^~^^, where ^ii^™*^^ is 
the scalar sum of the transverse energy excluding the primary lepton and leading four jets. 
The uncertainty comes from studies of events with no real missing energy and no hard jet 
activity. 

The unclustered energy is the observed quantity and the input to the fit? but it is 
related to the missing energy through the other measured physics objects in the event, 
since the pp system has total transverse momentum close to 0. The neutrino transverse 
momentum p^ is calculated at each step of the fit, using the fitted values of lepton, jet, and 
unclustered transverse energies: 



Note that this quantity, used in the mass fitting procedure, is different from the missing 
energy described in Section IllDI and used in event selection, where simpler calorimeter 
energy corrections are used. 

Although other treatments of the unclustered energy and missing energy can be moti- 
vated, the calculation does not have a large effect on the results of the fit- Various 
other approaches to correcting the unclustered energy and assigning resolution were tried, 
and no changes had any significant effect on the reconstructed top quark mass resolution. 

The mass of the neutrino is fixed at zero, and the longitudinal momentum, p^, is a free 
(unconstrained) parameter in the fit. The initial value of is calculated using the initial 
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value of the lepton four- vector and the initial p^, assuming that they arise from a W boson 
at the nominal pole mass. Since these conditions yield a quadratic equation, there are in 
general two solutions for the p'^; a separate fit is done with each solution used as the 
initial value of p'^. When the solutions are imaginary, the real part ± 20 GeV are the two 
values of used to initialize the fit. 

B. Event fit 

Given the inputs described above, the event-by-event fit for the reconstructed top quark 
mass proceeds as follows. MINUIT is used to minimize a where ml^° is a free parameter. 
For each event, the is minimized once for each possible way of assigning the leading four 
jets to the four quarks from the tt decay. Since the two W daughter jets are indistinguishable 
in the x^ expression, the number of permutations is ^ = 12. In addition, there arc two 
solutions for the initial value of the neutrino longitudinal momentum, so the minimization 
is performed a total of 24 times for each event. When b tags are present, permutations that 
assign a tagged jet to a hght quark at parton level are rejected. In the case of single-tagged 
events, the number of allowed permutations is six, and for double-tagged events, it is two. 
In the rare cases when an event has three b tags, two of the tagged jets must be assigned to 
b quarks. Wc use the reconstructed top quark mass from the permutation with the lowest 

after minimization. 

The expression has terms for the uncertainty on the measurements of jet, lepton, and 
unclustered energies, as well as terms for the kinematic constraints applied to the system: 



The first term constrains the pr of the lepton and four leading jets to their measured 
values within their assigned uncertainties; the second term does the same for both transverse 
components of the unclustered energy. In the remaining four terms, the quantities Mj^^, Mjj, 
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Mmu, and Mhjj refer to the invariant mass of the sum of the four- vectors denoted in the 
subscripts. For example, M^j, is the invariant mass of the sum of the lepton and neutrino 
four-vectors. is the pole mass of the W boson, 80.42 GeV/c^ and ml^'^° is the 

free parameter for the reconstructed top quark mass used in the minimization. Mjj is a 
quantity computed in the kinematic fit, and should not be confused with mjj, the measured 
dijet mass used to constrain JES. The fit is initialized with m^''™ = 175 GeV/c^. Tw and 
Tt are the total width of the W boson and the top quark. In order to use the formalism, 
the W and top Breit-Wigner lineshapes are modeled with Gaussian distributions, using the 
Breit-Wigner full width at half maximum as the Gaussian sigma. Tw is 2.12 GeV and 
i 4 IS i.o GeV Q- 

Thus these terms provide constraints such that the W masses come out 
correctly, and the t and t masses come out the same (modulo the Breit-Wigner distribution, 
here modeled by a Gaussian, in both cases). 

The jet-quark assignment (and solution) with the lowest after minimization is 
selected for each event. The of this combination is denoted Xmin (o^ J^st when the 
context is unambiguous), and the requirement Xmin < 9 is imposed. The expected statistical 
uncertainty on the top quark mass does not change much over a wide range of the value of 
the cut, even when it is varied independently for the four event types. The value of the cut 
chosen is close to the minimum of expected top quark mass uncertainty. 



C. Dijet Mass and Jet Energy Scale 

We calculate the dijet masses used to constraint JES in the same data sample used to 
reconstruct the observed top quark mass, with the exception that there is no requirement 
on the jet-quark assignments under consideration. The imposition of the requirement 
would impose a bias in the dijet masses being considered and therefore reduce the sensi- 
tivity of the dijet mass distribution to JES. We calculate the dijet masses directly from 
the measured jet four-vectors without the use of a kinematic fit, considering all jet-quark 
assignments in each event for any of the leading 4 jets that are not 6-tagged. Monte Carlo 
studies have shown that the sensitivity of the dijet mass distribution to the JES parameter 
is maximized by considering all dijet mass combinations that do not involve a 6-tagged jet 
in each event. The number of possible assignments ranges from one (for events with two b 
tags) to six (for events with no b tags). 
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D. Mass reconstruction results 

Typical reconstructed top quark mass distributions for signal Monte Carlo (Mtop = 
178 GeV/c^) are shown for the four event categories as the light histograms in Fig. [7| Each 
event in the sample that passes both event selection and the cut contributes exactly one 
entry to these histograms. The distributions peak near the generated mass of 178 GeV/c^. 
But there is not an exact correspondence between the generated mass and the mean or peak 
position of the reconstructed mass. Differences can arise when ISR/FSR jets are selected 
instead of the ti decay products; even with the correct jets, the fit may choose the wrong 
jet-quark assignment. In particular, the broader shape, beneath the relatively sharp peak 
at 178 GeV/c^, comprises events where an incorrect permutation has been chosen in the 
fit. The dark histograms in the same figure show the reconstructed mass distributions for 
events where the four leading jets correspond to the four quarks from ti decay, and where 
the correct jet-quark assigment is chosen by the fit. These histograms have much smaller 
tails than the overall distributions, and account for 47% of the 2-tag sample, 28% of the 
l-tag(T) sample, 18% of the l-tag(L), and 20% of the 0-tag category. 

The corresponding dijet mass distributions for the W boson reconstruction are shown in 
Fig. IHlfor the four subsamples. Each event contributes 1, 3, or 6 entries to the distributions, 
depending on the number of h tags. One sees a clear W boson mass signal, with a peak 
near the nominal W boson mass of 80 GeV/c^. The peak becomes more evident with 
increasing numbers of 6-tagged jets in the event, a consequence of the decreasing number of 
combinations for W boson jet daughters. 

Some results of the mass reconstruction on Monte Carlo tt signal (Mtop = 178 GeV/c^) 
and background samples are given in Table ITTTl The four subsamples have significantly dif- 
ferent nf^'^" and mjj shapes for ti signal and background, as evidenced by their reconstructed 
mass mean and RMS values. The cut efficiency is lowest for 2-tag events, especially for 
the background processes, because there are fewer allowed jet-quark assignments and thus 
fewer chances to pass the cut. The efficiencies for signal events vary only weakly with the 
generated top quark mass, and for the purposes of this analysis are assumed to be constant. 
The means of the background reconstructed mass distributions are primarily driven by the 
jet cuts (see Table |T)). 

The reconstructed top quark and dijet mass distributions for the 165 events found in the 
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FIG. 7: The light histograms show the reconstructed top quark mass distribution for the 
178 GeV/c^ HERWIG ti sample at the nominal jet energy scale. Overlaid are darker histograms 
of the reconstructed mass distributions using the subset of events for which the leading four jets 
are matched (within AR = 0.4) to the four quarks from the ti decay and the correct jet-quark 
assignment has the lowest x^- Distributions are shown for 2-tag (upper left), l-tag(T) (upper 
right), l-tag(L) (lower left), and 0-tag (lower right) events. 

data can be seen in Fig. IHl These events consist of both ti signal and background events. 
Figure ^1 shows distributions of values from the top quark mass reconstruction in data 
and simulated events, where the distributions from simulation contain the expected mixtures 
of signal and background events. Kolmogorov-Smirnov tests, with probability normalized 
using many trial distributions randomly selected from the Monte Carlo predictions, show 
that the distributions agree well, indicating that kinematic quantities and resolutions are 
correctly simulated. 



V. TOP QUARK MASS FITTING 

The distribution of reconstructed mass (either m^*^™ or mjj) for a particular top quark 
mass (or background process) and jet energy scale is referred to as a template. We compare 
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FIG. 8: The reconstructed dijet mass distributions for the 178 GeV/c^ HERWIG sample at the 
nominal jet energy scale. Overlaid are darker histograms of the reconstructed mass distributions 
using the subset of events for which the leading four jets include two jets matched (within AR = 0.4) 
to the two quarks from the hadronic W decay, and plotting just the invariant mass of those two 
jets. Distributions are shown for 2-tag (upper left), l-tag(T) (upper right), l-tag(L) (lower left), 
and 0-tag (lower right) events. 

the reconstructed top quark mass distribution and the dijet mass distribution from data to 
the Monte Carlo templates to measure simultaneously the top quark mass and the jet energy 
scale. First, probability density functions (p.d.f.'s) for the reconstructed top quark and dijet 
masses are determined for signal events and background events in each subsample by fitting 
a functional form to the corresponding templates; the signal p.d.f.'s depend on the top quark 
mass and jet energy scale. The shift in jet energy scale is given by JES, which is the relative 
shift in units of the nominal uncertainty in the jet energy scale derived from the extrinsic 
calibration procedures (Section 1111 B|) . Although the jet energy scale uncertainty varies with 
jet momentum and pseudorapidity, a one unit shift in the JES parameter is approximately 
equivalent to a 3% shift in the jet energy scale for jets in tt events. We perform an unbinned 
likelihood fit to determine the values of the top quark mass and jet energy scale that best 
describe the data. At the end of this section, we describe a number of checks of the method 
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FIG. 9: The top histogram shows the reconstructed top quark mass mp™, and the bottom his- 
togram shows the dijet reconstructed mass m-jj, with events from the four subsamples represented 
by separate stacked histograms. In the mjj plot, each event has a different number of jet pairs, 
depending on the number of b tags in the event, but the entries are weighted so that the total 
contribution from each event is one unit. The highest mjj bin contains overflow entries. 
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TABLE III: Monte Carlo samples of tt signal and of background processes with the expected 
relative weights are run through the mass fitter. For signal and background in each of the 
four event categories, the table shows the efficiency of the cut and the mean and RMS of the 
resulting reconstructed mass distributions. The signal sample has Mtop = 178 GeV/c^, and the 
nominal jet energy scale is used for all events. 



Sample 


cut 


^reco 


(GeV/c2) 




(GeV/c2) 


Description 


eff. 


Mean 


RMS 


Mean 


RMS 


Signal 












2-tag 


0.65 


173.9 


26.6 


76.8 


34.5 


l-tag(T) 


0.85 


174.0 


31.8 


98.1 


49.1 


l-tag(L) 


0.80 


167.4 


30.8 


77.8 


41.7 


0-tag 


0.91 


179.3 


36.9 


112.6 


57.9 


Background 












2-tag 


0.38 


160.2 


35.1 


77.2 


53.3 


l-tag(T) 


0.73 


166.4 


42.2 


95.8 


59.7 


l-tag(L) 


0.71 


153.7 


37.3 


67.7 


46.3 


O-tas 


0.83 


182.6 


46.5 


114.0 


69.3 



using simulated events. 

A. Parameterization of Signal and Background Shapes 

Since templates are available only at discrete vahics of top quark mass and jet energy 
scale, the signal reconstructed mass distributions in each subsample are parameterized by 
a flexible functional form as a function of top quark mass and jet energy scale in order to 
smooth the distributions and interpolate between the templates. 

For background events, the parameterization has no dependence on top quark mass or jet 
energy scale; a single p.d.f. is used to describe each background reconstructed mass shape 
in each subsample. In principle, a shift in the jet energy scale will change the shape of the 
background templates. However, we have determined from studies of the background that 
the shape of the background templates are insensitive to the jet energy scale. Rather, the 
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FIG. 10: The distribution is shown for data events and for signal and background simulated 
events in the expected ratio. Distributions are shown for 2-tag (upper left), l-tag(T) (upper right), 
l-tag(L) (lower left), and O-tag (lower right) events. 



overall rate of background events does show some sensitivity to the jet energy response, and 
this uncertainty is incorporated into the uncertainty in the rate of background events in the 
sample. 

The same parameterizations are used for both m^''™ and mjj signal p.d.f.'s, although of 
course the fitted parameters are different. In the case of the background, different functional 
forms are required to fit well the m^'^™ and mjj templates. 



1. Signal shape parameterization 

Signal templates are produced using sets of Monte Carlo samples with the input top 
quark mass at 2.5-5 GeV/c^ intervals from 130 GeV/c^ to 230 GeV/c^ and the jet energy 
scale varying from —3.0 to +3.0 in steps of 0.5. Examples of the template shapes from each 
event category are given in Fig. [7| (m^^'^°) and Fig. |S1 (mjj). Table HVI shows the evolution 
of the mean, most probable value, and RMS of the reconstructed top quark mass templates 
as a function of top quark mass using selected generated mass samples and the nominal jet 
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TABLE IV: The evolution of the mp^° template parameters is demonstrated using selected signal 
Monte Carlo samples with generated top quark mass of 145 GeV/c^, 165 GeV/c^, 185 GcV/c^, 
and 205 GeV/c^, with the nominal jet energy scale. The mean, most probable value (MPV), and 
RMS of the template are given for each subsample in each generated mass. The mean and MPV 
of the templates are driven by the jet Et cuts, and the widths are dominated by the fraction of 
events with correct jet-quark assignments. 





Mtop 


2-tag 


l-tag(T) 


l-tag(L) 


0-tag 




145 


151.5 


155.1 


147.1 


163.9 


Mean 


165 


164.5 


166.5 


158.8 


174.3 


(GeV/c2) 


185 


178.2 


179.1 


171.1 


185.3 




205 


193.5 


190.5 


182.7 


194.2 




145 


144.5 


144.0 


140.7 


145.0 


MPV 


165 


163.8 


159.5 


156.5 


159.3 


(GeV/c2) 


185 


179.9 


178.1 


171.7 


179.5 




205 


198.5 


194.7 


185.4 


193.9 




145 


25.1 


31.7 


28.5 


39.2 


RMS 


165 


24.8 


31.8 


28.6 


39.1 


(GeV/c2) 


185 


27.1 


32.3 


32.1 


37.7 




205 


28.6 


33.6 


34.1 


37.7 



energy scale. 

We derive from these distributions parametrized templates that are a smoothly varying 
function of top quark mass and jet energy scale. For any given top quark mass and jet energy 
scale, the probability to observe a particular reconstructed mass is specified by a function 
consisting of two Gaussians — intended to account for the well reconstructed quantities — 
plus a gamma distribution — intended to account for cases where the incorrect jets arc used 
to reconstruct the top quark or W masses. The 9 parameters necessary to specify this 
combination of functions are themselves assumed to depend linearly on the top quark mass 
and jet energy scale, so that the full set of p.d.f.'s is specified by 27 parameters. This 
assumed functional form works well in the limited range of top quark masses and jet energy 
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scales considered; as an example, letting the nine parameters have quadratic dependence on 
Mtop or JES does not improve the fit. Thus the parameterization is as follows: 



P,(m; Mtop,JES) 



"7 ■ FTT"^ r ■ - "o)"' ■ exp {-a2{m - ao)) 

r(l + «i) 



l+ai 



+ as ■ 1= ■ exp 



-(m - as)^ 



\2 



*4 



+ (l-ar-a8)-^7=-exp(^!^^^); (V.l) 



where 



ai=Pi+ Pi+9 ■ (Mtop - 175) + p,+i8 ■ (JES). 

The variable m in GeV/c^ refers to the reconstructed top quark or dijet mass, Mtop in GeV/ 
refers to the top quark mass, and JES refers to the shift in the jet energy scale from that 
determined from our calibrations. These template parametrizations are normalized so that, 
for a given top quark mass Mtop and jet energy scale JES, the integral over all reconstructed 
masses m is unity. 

A binned likelihood fit is used to determine the 27 parameter values both for the ml^'^° 
templates and for the mjj templates. The is calculated between the MC samples and the 
prediction from the fit, after rebinning to ensure that each bin has at least five predicted 
events. The resulting values are given in Table along with the number of degrees 
of freedom. Clearly the corresponding probabilities are small, not surprising considering 
the limited flexibility of the functional form and the large statistics of the templates. But 
the method check and calibration of Section IV (^1 show that the disagreement between the 
templates and the parameterizations is not large enough to have a significant effect on the 
measurement. 

In Fig. ^2 four signal templates at varying generated masses are shown overlaid with 
the fitted parameterization evaluated at each mass. This figure exhibits the changing shape 
of the reconstructed mass templates as a function of top mass. Figure ^1 shows the mjj 
templates with varying jet energy scale, overlaid with the fitted parameterization. One sees 
that the location of the W boson peak is sensitive to the jet energy scale. 
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FIG. 11: Four mp™ signal templates for the l-tag(T) sample are shown, with top quark masses 
ranging from 145 GeV/c^ to 205 GeV/c^ and with JES set to 0. Overlaid are the fitted parame- 
terizations at each generated mass, taken from the full parameterization given in Eg. IV. 11 

2. Background shape parameterization 

Monte Carlo simulations of the various processes listed in Section FlI El are used to model 
the reconstructed top quark mass shape and dijet mass shape for background processes. 
When possible, a single large-statistics sample is used to represent several background pro- 
cesses that have similar template shapes. 

For the tagged backgrounds, the W + heavy flavor processes {Wbb, Wcc, Wc) all have 
similar reconstructed mass shapes, as shown in Fig.^Jfor m\^'^° in the l-tag(T) sample, and 
thus are all modeled with a high-statistics Whh simulated sample. WW and WZ events, a 
neghgible contribution to the total expected background, are also included in this category. 
The shapes for the three subsamples with tagged events are found by reconstructing the 
simulated events exactly as is done for the data and signal Monte Carlo. Similarly, the 
simulated s- and t-channel single top quark events are used to obtain corresponding mass 
templates. 

The mass templates for the ly +jets backgrounds in the tagged subsamples, i.e. "mistags," 
are not obtained using the Monte Carlo h tagging, which is not expected to model well the 
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FIG. 12: Four mjj signal templates for the 2-tag sample are shown, with JES values ranging 
from — 3 cj to 3 cj and with the top quark mass set to 180 GeV/c^. Overlaid are the fitted 
parameterizations at each value of jet energy scale, taken from the full parameterization given in 
Ea.lVll 

rate or kinematic dependences of fake tags. Instead, a mistag matrix, derived from the 
data, is used to give the probability for a jet to be falsely tagged as a function of its Et, 
(f), 1], number of tracks, and the J^Et for all jets in the event. Then for each simulated 
event {W + 4 partons, generated by alpgen and showered by herwig), every possible tag 
configuration on the leading four jets is considered. For every tag configuration, the fit with 
lowest among the jet-quark assignments consistent with the assumed tags is selected, and 
the appropriate mass template is filled with a weight corresponding to the probability of 
observing that set of tags. The result is a weighted template for the mistag backgrounds. 

The backgrounds that are least amenable to Monte Carlo modeling arise from QCD 
background events, i.e. events with no real W to produce the isolated lepton and These 
events are difficult to simulate, but can be studied by selecting events in the data with non- 
isolated leptons, which are enriched in this type of background, but kinematically similar to 
events chosen in the default selection. 

The mass reconstruction described in Section |TV| is expected to produce similar results 
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m[«'=° (GeV/c ) 



FIG. 13: The templates for Wcc and Wc background processes are compared to the high-statistics 
Wbb template for all tagged events. The agreement is good in both cases, so the Wbb template is 
used to represent all W + h.f. processes. 



for QCD background events and W + jets background events. This is because the leptonic 
W system, in which these types of events differ, does not have a strong effect on the mass 
reconstruction, since is poorly measured and since the W mass is constrained in the 
expression. In the kinematic properties of the jets, to which the mass reconstruction is very 
sensitive, these two types of events are similar since in both cases the jets arise from hard 
QCD radiation. 

Indeed, within the limited statistics available, the reconstructed mass distributions of the 
QCD-enriched data events are consistent with those of simulated W + jets events. Given 
these similarities, the W + jets reconstructed mass templates are used also for the expected 
contributions from QCD for both the reconstructed top quark mass and the dijet invariant 
mass. An additional check, treating the subset of QCD events where the primary lepton is a 
jet misidentified as an electron, is performed using a large QCD-dominated dataset with at 
least five jets. One jet with a large fraction of its energy in the electromagnetic calorimeter is 
assigned to "fake" an electron, and the mass reconstruction proceeds under that hypothesis. 
Very good agreement is found between the reconstructed mass distributions of these events 
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and those of simulated W + jets events. A systematic uncertainty on the background mod- 
ehng (see Section IVII C|) is assigned using the differences between the templates obtained 
from these three samples: +jets events, events with non-isolated leptons, and events with 
one jet assigned to "fake" an electron. 

The background for the 0-tag subsample is treated separately from the others. The 
dominant process is + jets, with a smaller (~ 20%) contribution expected from non-iy 
(QCD) events. Since we model the reconstructed mass of QCD events using VT+jets events, 
the entire 0-tag background shape comes from W + 4: parton Monte Carlo events, simulated 
by ALPGEN and showered by herwig. 

We do not allow the normalization of each background contribution to vary independently 
in the final likelihood fit. Instead, for each subsample, the templates from all background 
processes are combined in their expected ratios according to Table m A single function is 
fitted to the combined background for each subsample and is used to describe the background 
shape in the final likelihood fit (Section I VBj) . The overall background normalization for each 
subsample is then permitted to vary, within its constraint where applicable. 

We determine the p.d.f.'s for the background reconstructed top quark mass templates 
using a parameterization similar in spirit to that of the signal, but simpler in form. First, 
there is no dependence on top quark mass or jet energy scale. Second, no narrow Gaussian 
peak is expected, so the full shape is modeled by the integrand of the gamma distribution. 
Specifically, 

i+pi 

■exp(-p2(mr°-Po)). (V.2) 

In the case of the 0-tag background events, a slightly more sophisticated function is used to 
achieve a good fit: 

n(mr) = p^f^^^ ■ (mr-por 

■exp(-p2("^r° -Po)) 



1+P4 

+ (i-P6) ^.^^^„, -(mr-P3r 



■exp(-p5(mr°-P3)). (V.3) 
In the background events, the following parameterizations are used to fit the reconstructed 
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TABLE V: The and number of degrees of freedom are given for the signal parameterization fits 
in each of the four subsamples. 



2-tag 


l-tag(T) l-tag(L) 


0-tag 


13977/13081 


X^/n.d.o.f. for reconstructed top quark mass fEq. IV.l)) 
18736/17052 14209/13444 


18791/16752 


19541/17000 


X^/n.d.o.f. for dijet mass fEa. IV.lj) 
29410/25732 23506/19827 


38315/30510 



dijet mass templates. For the background samples with b tags, 



and for the 0-tag sample, 



a 



l + Ol 



r(l + ai 
■exp(-a2(mjj - ao)) 



• exp 



27r 

ma - «3) 

2al 



a 



■ (mjj - ao) 



"1 



r(l + ai 

■exp(-a2("^jj - tto)) 
1 

+ "8 ■ 



• exp 



Q;4V 27r 

(mjj - a'if 



2al 



+ (1 — ar — as) 



1 



■ exp 



-«5 

2al 



2~ 



(V.4) 



(V.5) 



The final background templates for the reconstructed top quark mass and dijet mass for 
the four subsamples are shown in Fig. HH and Fig. UHl respectively, overlaid with the fitted 
parameterization. 
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FIG. 14: Reconstructed top quark mass distributions of the combined backgrounds in each sub- 
sample. The contributions from different background templates are shown stacked; overlaid are 
the fitted curves (see Eq.lVHand Eq.|V21). 
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FIG. 15: Reconstructed dijet mass distributions of the combined backgrounds in each subsample. 
The contributions from different background templates are shown stacked; overlaid are the fitted 
curves (see Eq. IV. 41 and Eq. IV.5|) . 
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B. Likelihood Fit for Top Quark Mass 



The reconstructed mass distributions from data are simultaneously compared to the tem- 
plates from signal and background sources using an unbinned extended likelihood fit. The 
likelihood includes free parameters for the number of expected signal events and back- 
ground events rib in each subsample, and for the top quark pole mass Mtop and the jet energy 
scale JES. For each subsample, the likelihood is given by: 



sample 



-^shape ^ -^shape ^ ''-nev ^ L,^ 



(V.6) 



where 



[6,n,P,(m-r; Mtop, JES) 



jC, 



shape 



shape 



esUs + ebUb 



nsPsirrijj^k] Mtpp, JES) + nbPb{my^,k) _ 



Us + Ub 



n 

k=l 
N-C, 

n 

k=l 

Us) X Pois(A^b; Ub) x 

Ns+Nt=N 

J2 B{Nf;N,,es)BiN^";Nb,eb) 



bg 



exp 



(rib 



n. 



Q\2 



(V.7) 



The values and represent the efficiency of the cut for signal and background events, 
respectively, and are given in Table 11111 and A^^^ are the number of events observed in 
the data before and after the cut. All other symbols are explained below. 

The most important information on the top quark mass is provided by the products in 

^rcco 

-^shape' term of which gives the probability of observing the fcth data event with 

reconstructed mass m^^^" , given the background reconstructed top quark mass template, 
Pb{ml^j:°), and the signal reconstructed top quark mass template with a top quark mass of 
Mtop and energy scale shift JES, Ps{mf^°; Mtop, JES). 

The second term, C^^^^, is sensitive primarily to the value of JES. It refiects the product 
of probabilities for each of Q ways of assigning the two W daughter jets in each event, where 
Ci = 1, 3, 6 for 2,1,0 tags, respectively. The probabilities are analogous to those in the first 
term, but are defined using the dijet mass template parameterizations. 
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The third term in the hkehhood, £nev, represents the information arising from the number 
of signal and background events in the top quark mass and dijet mass samples, which are 
correlated. Since the number of expected signal and background events in the W jj 
sample are Ug and n^, the expected numbers of signal and background events in the ■ml'^'^° 
sample are given by e^n^ and e^nb, respectively. This term expresses the likelihood associated 

2 

with observing and A^-*^ events in the two samples given the expected number of events 
as defined above. We introduce the variables Ng and A*";,, the (unknown) number of signal 
and background events actually in our sample. We sum over the possible values of these two 
variables that are consistent with the total number of observed events. Thus the first two 
factors express the Poisson probability to observe A^^ signal and A";, background events given 
Poisson means of rig and nj,, respectively. For the final factor we introduce and sum over the 
possible values of A^^ and A^^ , variables for the (unknown) actual number of signal and 
background events remaining after the cut. The summand is the binomial probability to 

2 

observe A^^ signal events and A"^ background events in the ml^^" sample given the assumed 
numbers of observed events A^s and Nh in the mjj sample and the cut efficiencies. 

Finally, in the fourth term, £bg, the background normalization is constrained in the like- 
lihood fit by a Gaussian probability distribution centered at n° and with width an^. The 
background normalizations are constrained for the 2-tag, l-tag(T), and l-tag(L) samples. 
For the 0-tag subsample, no background normalization estimate is available, so no back- 
ground constraint is used. Both Ug and rib are required to be greater than zero. 

As described in Sections IIII Bl and IIII C\ independent detector calibrations and studies 
of other processes allow us to independently determine the jet energy scale JES, and this 
information is used in the reconstruction of the data events and in the determination of 
the signal templates. We include in the likelihood fit the knowledge of this independent jet 
energy calibration through an additional term in the overall likelihood: 



where the simplification arises because, by our definition of JES, the calibrated shift in 
energy scale is JES° = and the uncertainty is ctjes = 1-0. 

The total likelihood is given by the product of the likelihoods for the four subsamples 




(V.8) 
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and the jet energy scale constraint: 

-tag ^ ■^l-tag(T) ^ '^l-tag(L) ^ -^O-tag ^ -^JES- (^-9) 

The top quark pole mass Mtop and the jet energy scale JES are shared between the four 
likelihoods. The likelihood is maximized with respect to all ten parameters [ug and nt for 
four subsamples, JES, and Mtop) using the minuit package. A likelihood curve as a function 
of Mtop is found by maximizing the likelihood with respect to all other parameters for a series 
of fixed Mtop. The statistical uncertainty from the fit procedure is taken from the points M^p 
and M^p where the log-likelihood changes by —1/2 unit from its maximum. The positive 
and negative uncertainties are then scaled to achieve unit pull widths as described in the 
next Section IV CI 

C. Method Check 

The method described above is checked for any possible systematic biases by running 
large numbers of "pseudo-experiments," where we create, using Monte Carlo simulation, 
samples of signal and background events with an assumed value of the top quark mass 
and jet energy scale and with the same statistical properties as our observed sample. We 
then perform likelihood fits to each pseudo-experiment and characterize the accuracy of the 
technique in determining the correct values. 

For each pseudo-experiment, the procedure is first to determine the number of signal and 
background events, then to generate a reconstructed top quark mass and dijet mass for each 
event, and finally to fit the resulting pseudo-data using our standard machinery. The number 
of background events in each subsample is Poisson fluctuated around the central value given 
in Table |n] The central value for the 0-tag background is estimated by subtracting the 
estimated number of 0-tag signal events from the observed number of 0-tag events in the 
data. The number of signal events is Poisson fluctuated around the number observed in 
the data, minus the central value for the background expectation, for each subsample. For 
each event, reconstructed masses ml'^^° and mjj are selected at random from the templates 
corresponding to signal or background processes. Some of the events are eliminated from 
the m^^™ sample, according to the cut efficiencies given in Table ITTTl The resulting list 
of reconstructed masses is fit using exactly the same machinery used on the data, described 
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in the previous Section IV Bl Although this default procedure does not model correlations 
among the ml'^'^° and rrijj values in each event, a separate check showed a complete modeling 
of the correlations to have a negligible effect on the checks described here. 

For each pseudo-experiment, the likelihood fit provides a measured top quark mass 
Mtop and jet energy scale JES, as well as positive and negative errors {6~^ and 6~) for 
each from the AlnL = —1/2 procedure. We check the pull distribution for Mtop, de- 
fined using a symmetrized uncertainty on the top quark mass as the distribution of 
(Mtop — M|.op"*)/0.5(5^^^^ + ^Mtop)' where M"p"* is the generated top quark mass. A pull 
distribution is generated for each of 12 input values for the top quark mass, keeping JES 
fixed to zero, where 2500 pseudo-experiments are generated for each input mass value, and 
each pull distribution is fitted using a Gaussian function. We determine a similar set of 
pull distributions for various values of the JES parameter, keeping the top quark mass 
fixed to 180 GeV/c^, although the results in this case are correlated since they all use the 
same Monte Carlo event sample. The mean and sigma of the fitted functions are shown in 
Fig. El Defining the pull distributions using the reported asymmetric uncertainties does 
not systematically change the results. 

In the pull distributions as a function of top quark mass, the pull means show a small 
offset for this particular slice of the Mtop- JES plane. Since the value of this offset varies 
with location in the Mtop- JES space, instead of directly correcting Mtop we take the average 
offset of 0.3 GeV/c^ as a systematic uncertainty on the measurement. 

In addition the pull widths are slightly larger than one due to the modest statistics of the 
event sample. For the expected number of events with current luminosity, and for templates 
such as the ones described in Section IV A| the resulting likelihood curve is typically non- 
Gaussian, and in fact, typically shallower than Gaussian. The pull distributions become 
more Gaussian (with width one) as pseudo-experiments with more events are performed. 
With ten times the statistics, the pull widths are consistent with unity. 

For the current data sample, the quoted measurement with uncertainties is designed to 
have pull width equal to one by scaling the errors taken from AlnL = —1/2. The scale 
factor is the pull width from the lower left plot of Fig. E[ averaged over the values of top 
quark mass, giving 1.027. 
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FIG. 16: The mean (top) and width (bottom) of puh distributions from sets of 2500 pseudo- 
experiments are shown. On the left, the jet energy scale is fixed at its nominal value, and the 
generated top quark mass is varied from 150 GeV/c^ to 210 GeV/c^. On the right, the top quark 
mass is fixed at 180 GeV/c^, and the input jet energy scale is varied from —3 a to -|-3 a. The 
error bars come mostly from the limited statistics of the Monte Carlo samples from which the 
pseudodata is taken. 

VI. RESULTS ON THE DATA 



We fit the events in the data using the procedure described in Section El After detailing 
the results, we present several additional results performed as cross-checks on the primary 
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FIG. 17: The contours of the hkelihood in the Mtop-JES plane for the independent fit to each 
subsample in the data. At each point in the plane, the likelihood is maximized with respect to the 
other free parameters. A crosshair shows the maximum likelihood point from the combined fit, and 
contours are given at regular intervals in AlnL, the change in log-likelihood from its maximum. 
Upper left: 2-tag events; upper right: l-tag(T) events; lower left: l-tag(L) events; lower right: 
0-tag events. 



measurement. 



A. Subsample Likelihood Curves 

The likelihood fit is first performed on each subsample separately. In these fits, each 
subsample likelihood contains the JES constraint term given in Eq. IV. 81 For a series of top 
quark masses and JES values, the mass and JES parameters are fixed, while the likelihood 
is maximized with respect to the remaining parameters [ug and rib) using minuit. The 
resulting likelihood contours in the Mtop-JES plane are shown in Fig. El 
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FIG. 18: The contours of the hkelihood in the Mtop-JES plane for the combined fit to all four 
subsamples. At each point in the plane, the likelihood is maximized with respect to the other free 
parameters. The crosshair shows the best fit point, and contours are given at at regular intervals 
in AlnL, the change in log-likelihood from its maximum. 

B. Results of Combined Likelihood 

Finally, the likelihood is maximized with respect to all parameters using all four sub- 
samples. The result, after scaling the A(lnL) = —1/2 errors as described in Section FV C\ 
is a top quark mass of 173.5 ^I'l (stat. + JES) GeV/c^. The simultaneous measurement of 
the jet energy scale is —0.10 IIqIq Oc- The correlation between the top quark mass and JES 
fits is -0.676. The combined likelihood as a function of top quark mass and JES is shown 
in Fig. ^1 For each value of the top quark mass and JES, the likelihood is maximized 
with respect to all other parameters. This likelihood is not the simple product of the four 
likelihoods shown in Fig. El because the JES constraint term £jes is included in each of the 
subsample fits, but of course only once in the combined fit. 

The uncertainty on Mtop from the likelihood fit is a combination of the statistical uncer- 
tainty in extracting a measurement of Mtop and the systematic uncertainty due to allowed 
variations of JES. It is possible to get an idea of the size of each contribution. Fixing JES to 
its fitted value of —0.10 Oc and fitting for Mtop alone yields a top quark mass measurement of 
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TABLE VI: The input constraints and fitted values are given for all free parameters in the combined 
likelihood fit. 



Categ 


;ory 




2-tag 


l-tag(T) l-tag(L) 


0-tag 


Mtop 




constr. 
fit 




None 

173.5 tl'l (stat. + JES) GeV/c^ 

(173.5 til (stat.) ±2.5 (JES) GeV/c^) 




JES 




constr. 
fit 

fit 


23.5 ±5.0 


0.0 ± 1.0 ac 

-n 10 +° '^^ CI 

^■^^ -0.80 '^c 

None 

53.9 ±7.9 14.3 ±5.2 


28.3 ±8.3 


< 




constr. 
fit 


1.89 ±0.52 
1.8 ±0.5 


10.4 ±1.72 14.3 ±2.45 
10.1 ±1.7 15.5 ±2.2 


None 

15.7 1?:? 


173.5 


+2.7 

-2.6 


(stat.) GeV/c2, 


corresponding 


; to the "pure statistical" uncertainty. 


Subtracting 



this uncertainty in quadrature from the full uncertainty gives an Mtop uncertainty due to 
the jet energy scale of ±2.5 GeV/c^. 

The input constraints and fit results for the combined fit are given in Table IVTl Figure ITTH 
shows the consistency of the reconstructed top quark mass distribution in each subsample 
with the combined fit results, while Fig. 1201 shows the same for the reconstructed dijet mass. 

A set of pseudo-experiments is generated with a top quark mass of 172.5 GeV/c^ (close 
to the central value from the fit), the nominal jet energy scale, and with the number of 
events in each subsample equal to the number observed in our data (Table H]). In Fig. El 
the positive and negative uncertainties from the likelihood fits are plotted. Arrows indicate 
the uncertainties from the fit to the data. Although smaller than the median uncertainties 
from the pseudo-experiments, the uncertainties on the data are reasonable — 9.2% of the 
pseudo-experiments have smaller uncertainties than those returned by the fit to the data. 
The distributions do not change significantly if a top quark mass value of 165 GeV/c^ 
or 180 GeV/c^ is used. The better-than-expected uncertainties are consistent with the 
sharpness of the reconstructed top quark mass peaks in the 2-tag and l-tag(T) subsamples, 
as shown in Fig. ^1 
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m[^^° (GeV/c^) 

FIG. 19: The reconstructed top quark mass distribution for each subsample is shown overlaid with 
the expected distribution using the top quark mass, jet energy scale, signal normalization, and 
background normalization from the combined fit. 





rrijj (GeV/c^) 

FIG. 20: The reconstructed dijet mass distribution for each subsample is shown overlaid with 
the expected distribution using the top quark mass, jet energy scale, signal normalization, and 
background normalization from the combined fit. 
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FIG. 21: The distributions of positive and negative uncertainties from the hkehhood fit are shown, 
for pseudo-experiments generated with a top quark mass of 172.5 GeV/c^, the nominal jet energy 
scale, and the number of events in each subsample as observed in the data. Arrows indicate 
the positive and negative uncertainties from the likelihood fit to the data; 9.2% of the pseudo- 
experiments have smaller uncertainties. 

C. Alternate Fits 

In addition to the primary result described above, a number of additional fits are per- 
formed as cross-checks and to investigate the effect of certain assumptions on our measure- 
ment. The differences between the primary fit and the alternate fits are briefly described 
below, along with the resulting top quark mass measurements. Table I VI II summarizes the 
results. The results from the alternate methods are quite similar, though the methods are 
highly correlated. 

1. Fit without JES constraint 

In the primary fit, the JES measurement is treated as an update to the extrinsic cal- 
ibration by including the JES Gaussian constraint £jes in the likelihood. Here the £jes 
term is removed, so that all the jet energy scale information comes from the in situ cali- 
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bration to the resonance of the hadronically decaying W boson. The resulting top quark 
mass measurement is 174.0 ± 4.5 (stat. + JES) GeV/c^, and the simultaneous fit for JES 
gives —0.25 ± 1.22 (stat.) Cc- Although the systematic uncertainties are not explicitly eval- 
uated for this approach, they are not expected to be significantly different from those of the 
primary analysis. 



2. Traditional Mtop-only fit 

For this alternate result, the traditional fit for a single variable, Mtop, is performed using a 
single reconstructed quantity, mf'^°. This fit is virtually identical to the analysis performed in 
Run I y|. The event selection and m^*^™ reconstruction are exactly as described earlier. With 
only one reconstructed quantity and one measured quantity, the template parameterizations 
are simpler. Signal and background p.d.f.'s for m^*^™ are fitted without any JES dependence, 
but otherwise identical in form to those described above. The form of the likelihood used is 
also much simpler, since the term -C^ape absent and C^ev is greatly simplified with only 
the sample of events after the cut used. For each subsample, the likelihood is given by 

C = Pois(A^; Us + rib) x 



rib 

where in this context n^, n^, and refer to the number of events expected and observed in 
the sample after the cut. The combined likelihood is simply the product of the subsample 
likelihoods. 

The fitted value of the top quark mass using this method is 173.2 l^g (stat.) GeV/c^, 
with a central value very close to the result from the primary measurement. For this result, 
of course, since the jet energy scale systematic uncertainty is not accounted for in the 
likelihood fit, its effect on the top quark mass uncertainty must be estimated separately 
and added in quadrature. The negative log-likelihood curves for the Mtop-only fit in each 
subsample are shown in Fig.|221 As can be seen in the lower-right panel, the 0-tag subsample 
contributes very little to the overall measurement. This is because the fit prefers a small 
signal contribution in this sample with no background constraint, which results in very little 
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FIG. 22: The negative log-likelihood curves as a function of the top quark mass are shown for the 
Mtop-only fit to each subsample in the data. Upper left: 2-tag events; upper right: l-tag(T) events; 
lower left: l-tag(L) events; lower right: 0-tag events. 

sensitivity to Mtop- 



3. Traditional Mtop-only fit with additional tag category 

Events with two b tags carry the most information about the top quark mass because of 
their high purity and narrow reconstructed mass templates. In this alternate analysis, we 
increase the number of events with two b tags by allowing one of the tags to come from the 
Jet Probability tagger. We establish a new category of events with exactly one secondary 
vertex tag and an additional JPB tag; the former requirement ensures that these events do 
not overlap with the 2-tag subsample. The events in this category are then removed from the 
l-tag(T) and l-tag(L) samples so that all the subsamples remain disjoint. Eighteen events 
are found in this category in the data sample: 4 out of 18 events were newly categorized 
from l-tag(L) and 14 events from l-tag(T) in the default configuration. 
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FIG. 23: The reconstructed top quark mass distribution for the 18 events with one secondary 
vertex tag and one JPB tag, overlaid with the expected distribution from the fit to this subsample. 
The inset shows the shape of —A log L for the fit to these events as a function of the top quark 
mass. 

The expected backgrounds in the new event category are estimated to be 0.52±0.26 events 
from Whb^ Wc, and Wcc processes, 0.15 ± 0.08 events from non-W background, 0.38 ± 0.19 
events from mistagged W + jets events, 0.08 ± 0.04 events from single top, and 0.05 ± 0.03 
events from the diboson processes WW and WZ. The total number of background events 
is thus estimated to be 1.2 ± 0.6 for the new subsample. The background estimates for the 
exclusive one-tag subsamples change to account for the reduced acceptances. 

The likelihood used to extract the top quark mass from this data is that described above 
in Section FVl C 21 i.e. using an Mtop-only fit. Figure ESI shows the reconstructed top quark 
mass distribution for the events with one secondary vertex tag and one JPB tag, along with 
the expected distribution using parameters taken from the fit to only this set of events. The 
inset shows the negative log-likelihood curve for this subsample alone. Using only the 18 
events in this subsample, the measured top quark mass is 173.3 til (stat.) GeV/c\ 

This result using this data sample can be combined with the other four categories of 
events. A sensitivity study shows that the combined likelihood including the new class of 
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TABLE VII: The results of alternate fits are summarized. For the cases that do not include the jet 
energy scale systematic effect in the likelihood fit result, the independently determined systematic 



is given for comparison (see Section fVII Dl for more details). 



Method 


Mtop fit result 


JES fit result 




[GeV/c2] 


K] 


Default 


173.5 tl'l (stat. + JES) 


-0 10 +°'^^ 
'^•J^U -0.80 


No JES constr. 


174.0 ± 4.5 (stat. + JES) 


-0.25 ± 1.22 


Mtop-only 


173.2 tal (stat.) ©3.1 (JES) 


N/A 


+ JPB 


173.0 tli (stat.) ©3.0 (JES) 


N/A 



events improves the expected statistical uncertainty by 2.6%. In addition to the statisti- 
cal improvement, increasing the number of double-tagged events improves the jet energy 
systematic uncertainty. The resulting combined top quark mass measurement on the five 
subsamples is 173.0 (stat.) GeV/c^. 

D. Cross-checks on the results 

The measurement of the top quark mass is checked by performing the analysis in various 
subsamples and with different jet corrections and background normalization constraints in 
order to ensure the robustness of the result. In these cross-checks, we use the traditional 
Mtop-only fit described in Section IVI C 21 for simplicity. 

Figure |21] shows the resulting top quark mass measurement for various modifications to 
the method. Any inconsistencies would most likely indicate problems with the detector 
or the analysis method. First the measurement using top-specific corrections (the default) 
and using generic out-of-cone jet corrections is compared. Next fits using the background 
constraints (the default) and without the constraints are shown. For the remaining compar- 
isons, the dataset is divided into two subsamples. First results are shown from two different 
run periods, then events with a primary electron vs those with a primary muon, and finally 
positive-charge primary leptons vs negative- charge primary leptons. Except in the case of 
the generic jet corrections, the default reconstructed mass templates are used. All the results 
are consistent with each other and with the primary measurement. 
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FIG. 24: Top quark mass measurements using the Mtop-only fit are compared for different as- 
sumptions (top-specific corrections (default) vs generic jet out-of-cone corrections, constrained 
backgrounds vs unconstrained) and different ways of subdividing the sample (two different run 
periods, electrons vs muons, positive-charge leptons vs negative-charge leptons). All results with 
only statistical errors are consistent. 

A series of top quark mass measurements using different subsamples of the data is shown 
in Fig. 123 The primary effects of the increasingly tight selection are, first, to increase the 
sample purity and thereby decrease sensitivity to modeling of the background processes; and 
second, to select events with low extra jet activity, decreasing sensitivity to modeling of ISR 
and FSR. The list of samples used is as follows from top to bottom: all four subsamples 
(default, 138 events after cut); 2-tag, l-tag(T), and l-tag(L) only (98 events); 2-tag and 
l-tag(T) only (73 events); 2-tag and l-tag(T) only with any additional jets required to have 
Et < 15 GeV (56 events); 2-tag and l-tag(T) only with any additional jets required to have 
Et < 8 GeV (38 events). For the last two cases, top mass templates are prepared with 
the additional requirements. Again we find that all results are consistent, indicating that 
background kinematics and extra jet activity are reasonably well modeled. 
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FIG. 25: Top quark mass measurements using the Mtop-only fit are compared for various samples. 
From top to bottom: all four subsamples; 2-tag, l-tag(T), and l-tag(L) subsamples; 2-tag and 1- 
tag(T) subsamples only; 2-tag and l-tag(T) subsamples only, additional jets Et < 15 GeV; 2-tag 
and l-tag(T) subsamples only, additional jets Et < 8 GeV. All results with only statistical errors 
are found to be consistent. 

E. Kinematic distributions 

We compare various kinematic distributions for the tt signal candidate events with the 
Monte Carlo predictions for combined signal and backgrounds. Comparisons of kinematic 
distributions tell us how well the Monte Carlo models the data, which is very important 
in this kinematic analysis. This information could additionally be used to test whether 
the kinematic properties of the top quark we observe are consistent with standard model 
predictions. For these distributions, we use only 2-tag and l-tag(T) events with < 9 (73 
events), in order to increase the signal purity. All kinematic quantities are defined using the 
output of the fitter, so that both jet-quark assignments and the pt of each object are 
taken at the minimum point. 

Figure EHl and Fig. |2ZI show the pt and rapidity distributions of the reconstructed top 
quarks, respectively. The data distributions are in agreement with predictions using herwig 



63 



> 

(5 
m 

CO 



c 

> 
IIJ 



50 



40 



30 



20 



10 



"1 



HERWIG ttbar (172.5 GeV/c ) 

W + mistags, non-W 

W + heavy flavor 

Single top 

Data 



± 



50 100 150 200 250 
Reconstructed Top p,^ (GeV/c) 



300 



350 



FIG. 26: The pT distribution of the reconstructed top quarks for 73 signal candidate events (2- 
tag and l-tag(T) subsamples), compared to the prediction from herwig tt signal events (with 
generated top quark mass of 172.5 GeV/c^) and simulated background events. 

tt signal events with top quark mass of 172.5 GeV/c^ and simulated background events. 

We also find good agreement in the pt distribution of the b jets from top decays, shown 
in Fig. I2H1 Good modeling of the 6-jet spectrum by the Monte Carlo simulation is one of 
the most important things for a good determination of the top quark mass. Figure QUI shows 
the Pt distribution of the reconstructed W bosons. 

The Pt distribution of the ti system is shown in Fig. 1201 which has good agreement 
between the data and the prediction from simulated events. This distribution is sensitive 
to the modeling of initial state radiation. The distribution of the number of jets from data 
events is also compared with the prediction from the Monte Carlo simulation, as shown in 
Fig. inU To be counted in this plot, each jet is required to have Et > 8 GeV and |?7| < 2.0; 
note that this distribution is sculpted by the selection requirements for the 2-tag and 1- 
tag(T) subsamples. The data and prediction are in good agreement, indicating that the 
number of extra jets (from hard initial and final state radiation) is reasonably well modeled 
by HERWIG. 
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FIG. 27: The rapidity distribution of the reconstructed top quarks for 73 signal candidate events 
(2-tag and l-tag(T) subsamples), compared to the prediction from herwig signal events (with 
generated top quark mass of 172.5 GeV/c^) and simulated background events. 
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FIG. 28: The pT distribution of the reconstructed b jets for 73 signal candidate events (2-tag and 
l-tag(T) subsamples), compared to the prediction from herwig tt signal events (with generated 
top quark mass of 172.5 GeV/c^) and simulated background events. 
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FIG. 29: The pT distribution of the reconstructed W bosons for 73 signal candidate events (2- 
tag and l-tag(T) subsamples), compared to the prediction from herwig tt signal events (with 
generated top quark mass of 172.5 GeV/c^) and simulated background events. 
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FIG. 30: The pT distribution of the reconstructed ti system for 73 signal candidate events (2- 
tag and l-tag(T) subsamples), compared to the prediction from herwig tt signal events (with 
generated top quark mass of 172.5 GeV/c^) and simulated background events. 
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FIG. 31: The number of jets distribution for 73 signal candidate events (2-tag and l-tag(T) sub- 
samples) , compared to the prediction from HERWIG ti signal events (with generated top quark mass 
of 172.5 GeV/c^) and simulated background events. Jets are required to have Et > 8 GeV and 
\tj\ < 2.0. 

VII. SYSTEMATIC UNCERTAINTIES 

Systematic uncertainties arise from uncertainties in our understanding of the detector 
response, and in the assumptions employed to infer a top quark mass from the observed 
data. The magnitudes of such uncertainties are estimated using auxiliary data collected for 
this purpose, and large samples of Monte Carlo simulated events that allow us to estimate 
the sensitivity of the measurements to reasonable variations in analysis assumptions. 

For each source of systematic uncertainty, the relevant quantities or parameters are varied 
by ±1(7, and new 178 GeV/c^ ti signal and background Monte Carlo templates are produced 
by performing event selection and mass reconstruction on the modified samples. Events for 
pseudo-experiments (see Section IV C|) are taken from these new templates, but the signal 
and background p.d.f.'s used in the analysis remain unchanged. The shift in the median 
fitted top quark mass for a large ensemble of pseudo-experiments is taken as the systematic 
uncertainty associated with a given assumption or effect. When the uncertainty on a given 
systematic shift due to the statistics of the Monte Carlo sample is larger than the shift itself, 
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that statistical uncertainty is used for the systematic uncertainty. 

A. Systematic Uncertainties Arising from the JES Calibration 

The use of the observed W boson mass to constrain the jet energy scale calibration 
essentially measures the average energy response of light quark jets arising from the decay 
of the colorless W boson. However, the top quark mass also depends on the energy response 
to b quark jets. This introduces three possible sources of uncertainty: i) uncertainties 
in energy response arising from uncertainties in the decay properties of bottom quarks, 
ii) uncertainties arising from the imperfect knowledge of the fragmentation properties of 
bottom quarks, and iii) uncertainties in energy response arising from the different color flow 
associated with bottom quark jets produced in top quark decay. 

We varied the B meson semi-leptonic branching ratios by about 10% of their values, cor- 
responding to their measurement uncertainties j2S|, in our Monte Carlo models to estimate 
the size of this uncertainty in the overall energy scale of the bottom quark jet. We found 
that this introduced an additional uncertainty in the bottom quark jet energy scale of 0.4%, 
resulting in an uncertainty in the extraction of the top quark mass of 0.4 GeV/c^. We used 
the high-statistics measurements of bottom quark fragmentation observed in Z ^ bb decays 
at the LEP and SLC colliders to constrain the fragmentation models in our Monte Carlo 
calculations. We found that this variation introduced an additional top quark mass uncer- 
tainty of 0.4 GeV/c^. In order to test the effects of possible variations in energy response 
due to different models of "color flow" in the top quark production and decay, we varied the 
parameters of the algorithms used to generate this color flow in both herwig and pythia 
and conservatively estimated that this could result in an uncertainty in the bottom quark 
jet energy scale of 0.3%. This results in an additional uncertainty in the top quark mass of 
0.3 GeV/c\ 

We add these three contributions in quadrature and include an additional 0.6 GeV/c^ 
systematic uncertainty in the top quark mass arising from the modeling of the bottom quark 
jets. 

In this analysis, the jet energy scale is assumed to have the same value for all jets in all 
events. However, the jet energy systematics have contributions from many sources, and those 
component uncertainties in general have different dependence on, for example, jet pt and t], 
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or on the event environment. To estimate the uncertainty arising from the assumption of a 
monohthic jet energy scale, we produce samples in which the components of the jet energy 
systematics are shifted independently and in various combinations. The typical shift in the 
top quark mass measurement is 0.5 GeV/c^, which is taken as the largest part of a "method" 
systematic. This systematic includes the offset of 0.07 cr observed in the pull distributions 
of Fig. ^[ which translates to 0.3 GeV/c^. 

Finally, the effect of the jet energy scale uncertainty on background events must be treated 
separately, since it is not included in the background template parameterization. We find 
a small uncertainty on Mtop, 0.04 GeV/c^, which we add linearly to the uncertainty due to 
the overall jet energy scale since the effects are correlated. 



B. ISR/FSR/PDF Systematic Errors 

The systematic uncertainties due to initial state radiation, final state radiation, and 
parton density functions are summarized in this section. 

Extra jets originating from the incoming partons and outgoing partons affect the mea- 
surement of Mtop when they are misidentified as jets from the final state partons or change 
the kinematics of the final state partons. ISR and FSR are controlled by the same DGLAP 
evolution equation that tells us the probability for a parton to branch 30, 31, 32, 33, Is^ . 
ISR is studied using Drell-Yan events in dilepton channels. The advantage of Drell-Yan 
events is that there is no FSR, and they are produced by the qq annihilation process, as are 
most (~ 85%) ti pairs. 

The level of ISR is measured as a function of the Drell-Yan mass scale and shows a loga- 
rithmic dependence on the Drell-Yan mass squared, as shown in Fig. EH By extrapolation, 
the ISR effect is then estimated at top pair production energies. Based on this measure- 
ment, two ISR systematic Monte Carlo samples {+1<7isr and —Ictisr) are produced using 
PYTHIA, by varying the value of Aqcd and scale factor, K to the transverse momentum 
scale for ISR showering. The parameters used are Aqcd{5 flavors) = 292 MeV, K = 0.5 for 
+laisR and Aqcd{5 flavors) = 73 MeV, K = 2.0 for —lajsR- The corresponding curves of 
Drell-Yan dilepton (px) vs invariant mass squared are shown in Fig. IHSl Although ISR is 
also sensitive to the choice of parton distribution function (PDF), the PDF uncertainty is 
not included as a part of the ISR uncertainty. Because a PDF change affects not only ISR 
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FIG. 32: The average pT of the dilepton system, which corresponds to the level of ISR activity, 
shows a logarithmic dependence on the dilepton invariant mass M^. The data are compared with 
the predictions of pythia 6.2 and of the +laisR and —lajsR samples. 

but also hard scattering kinematics, the PDF uncertainty is treated separately. The largest 
top quark mass shift between default pythia and the two ISR samples, 0.4 GeV/c^, is taken 
as the ISR uncertainty. 

Since ISR and FSR shower algorithms are the same, the same variations in Aqcd and 
K are used to generate FSR systematic samples by varying a set of parameters specific to 
FSR modeling. The largest top quark mass shift between default pythia and the two FSR 
samples, 0.6 GeV/c^, is used as the FSR uncerta inty . We examine the effects of higher 
order corrections to ti production using MC@NLO j35|, a full NLO Monte Carlo. Based on 
distributions of the number of jets and the ti pr, we find that NLO effects are covered by 
the ISR/FSR systematics. 

The calculation of the top quark invariant mass does not depend directly on the choice 
of input PDF. However, changing the PDF changes the top quark rj and pt distributions 
as well as the size of ISR. This results in a change in the jet pt distributions and in the 
probability of selecting the correct jets, both of which affect the reconstructed top quark 
mass. 
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To examine the systematic effect due to PDF uncertainties, 20 pairs of uncertainty sets 



based on CTEQ6M are used 
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These PDFs provide "±lcr" variations for 20 in- 
dependent eigenvectors, but do not include variation in Aqcd- In addition, the MRST 
group 3^ provides PDFs with different assumptions for the vahie of Aqcd- The difference 
between the measured top quark mass using the MRST72 {Aqcd =228 MeV) and MRST75 
{Aqcd =300 MeV) PDFs is taken as an uncertainty, as is the difference between leading 
order PDFs CTEQ5L and MRST72. Instead of 43 different, fully simulated sets of events, 
a single simulated sample is used, and mass templates are generated for the different PDF 
sets by weighting events according to the probability of observing their incoming partons 
using each PDF set. This technique also removes most of the uncertainty due to limited 
Monte Carlo statistics. A symmetrized uncertainty for each of the 20 pairs of CTEQ6 PDFs 
(determined by varying one eigenvector at a time) is added in quadrature to get one part 
of the PDF uncerainty, 0.20 GeV/c^. An additional systematic error of 0.22 GeV/c^ comes 
from the variation of Aqcd- This is consistent with the much less precise estimate using fully 
simulated samples. Adding a negligible contribution from the CTEQ-MRST difference, the 
total PDF uncertainty comes to 0.3 GeV/c^. 

In order to check the sensitivity of the top quark mass measurement to a very different 
top quark px distribution due to a new physics process, we have used a signal Monte Carlo 
sample with resonant tt production, where the resonance occurs at 700 GeV/c^ and then 
top quarks decay according to the standard model. The measured top quark mass is shifted 
by only 1.5 GeV/c^, demonstrating that this kinematic top mass fitter is nearly insensitive 
to the pt of the top quark. 



C. Other Systematic Errors 

The remaining sources of systematic uncertainty are described in this section. 

The difference in the top quark mass between HERWIG and pythia samples is 0.2 ± 
0.2 GeV/c^. To be conservative, this difference is taken as another systematic uncertainty, 
although the differences in ISR and FSR between the two generators are already taken into 
account in the ISR and FSR uncertainties, and fragmentation effects are accounted for in 
the jet energy uncertainties. 

The largest uncertainty in the shape of the reconstructed mass templates for background 
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events is due to the uncertainty in the scale that is used for the calculation of the hard 
scattering and for the shower evolution. Different background shapes are obtained for four 
different scales (4M^, M^, and + P|vf) using alpgen MC samples. An 

ALPGEN W\k + 2 parton Monte Carlo sample is used for the tagged events and a W + 4 
parton sample for the 0-tag and mistagged events. Half of the largest difference in top quark 
mass from pseudo-experiments using these samples is used as the systematic uncertainty, 
0.4 GeV/c^. Smaller contributions to the background shape uncertainty are estimated by 
performing sets of pseudo-experiments in which background events are drawn not from the 
combined background template but from templates for one of the individual background 
processes, or from the templates derived from QCD-enriched data. Half of the largest 
difference observed in these pseudo-experiments is 0.3 GeV/c^ for the different background 
processes, and 0.1 GeV/c^ for the different models of the QCD background. Both of these 
are taken as additional systematic uncertainties on the top quark mass due to background 
shape modeling. 

Different 6-tagging efficiency in data and simulation can introduce a bias in the top quark 
mass measurement. The i?;^* dependence of the b tagging in data and simulation agree very 
well. But if a slope on the E^r^^ dependence (consistent at 1 a with the measurement) is 
introduced in the tagging efficiency, the shift in the top quark mass is 0.1 GeV/c^, which is 
taken systematic uncertainty. 

The analysis can have a systematic bias due to the finite statistics of Monte Carlo sam- 
ples that are used to obtain the signal and background shape parameterizations. For a 
rough estimate of this uncertainty, sets of pseudo-experiments are performed with a series 
of fluctuated signal and background templates; in each fluctuated template, each bin is 
varied randomly according to Poisson statistics. For each fluctuated template, the median 
top quark mass measured by pseudo-experiments is shifted. The typical shift due to these 
statistical fluctuations, taken as a systematic uncertainty due to Monte Carlo statistics, is 
0.3 GeV/c2. 

D. Jet Systematic Errors 

The systematics on jet energy measurements are described in detail in Section QllBI The 
primary analysis fits for the jet energy scale, and the error from the likelihood fit includes 
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TABLE VIII: The uncertainties on the Mtop-only top quark mass measurement are shown for each 
jet energy systematic error. Estimates are obtained for the independent subsamples as weh as for 
the combined measurement. 


Jet energy systematic 

AMtop (GeV/c2) 
2-tag l-tag(T) l-tag(L) 0-tag 


Combined 


Response relative to central 
0.6 0.6 0.6 1.0 


0.6 


Modeling hadron jets (absolute scale) 
2.0 2.3 2.0 2.7 


2.2 


Modeling out-of-cone energy and underlying event 
2.2 2.2 1.9 1.9 


2.1 


Total systematic due to jet energies 
3.0 3.2 2.8 3.4 


3.1 


a contribution due to these systematics. For the Mtop 


-only fits, however, this systematic 



uncertainty must be estimated independently. 

To determine the systematic uncertainty on the top quark mass measurement given the 
various sources of uncertainty on the jet energy measurements, the mass shifts for +la 
and —la perturbations in the jet energies are extracted, and a symmetric uncertainty for 
each source is defined as half the difference between the two shifts. Table I VI III lists the 
uncertainties obtained for the Mtop-only measurement. The total systematic uncertainty 
in the top quark mass due to jet energy measurements is 3.1 GeV/c^ for the combined 
measurement. The corresponding systematic uncertainties for an independent measurement 
in each subsample are listed for comparison. 

E. Total Systematic Uncertainty 

The systematic uncertainties for the combined fit are listed in Table IIXI The total 
systematic uncertainty is estimated to be 1.3 GeV/c^, exclusive of the uncertainty due 
to jet energy scale that is included in the likelihood error. Also shown in Table IIXI are 
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TABLE IX: This table summarizes all systematic uncertainties for the combined analysis and two 
alternate fits. 



Method 


Primary 




Mtop-only 


Mtop-only 
+ JPB 




AMtop 


AJES 


AMtop 


AMtop 




(GeV/c2) 




(GeV/c2) 


(GeV/c2) 


Jet Energy 


N/A 


N/A 


3.1 


3.0 


6-jet Energy 


0.6 


0.25 


0.6 


0.6 


Method 


0.5 


0.02 


N/A 


N/A 


ISR 


0.4 


0.08 


0.4 


0.3 


FSR 


0.6 


0.06 


0.4 


0.6 


PDFs 


0.3 


0.04 


0.4 


0.4 


Generators 


0.2 


0.15 


0.3 


0.2 


Bkgd Shape 


0.5 


0.08 


0.5 


0.5 


b tagging 


0.1 


0.01 


0.2 


0.3 


MC stats 


0.3 


0.05 


0.4 


0.4 


Total 


1.3 


0.33 


3.3 


3.2 



the systematic uncertainties on the JES measurement (0.33 (7c total), and the systematic 
uncertainties on Mtop for the Mtop-only measurement without (3.3 GeV/c^ total) and with 
(3.2 GeV/c^ total) JPB tags. 

VIII. CONCLUSION 

We have made a new measurement of the top quark mass, 

173.5 tli (stat. + JES) ± 1.3 (other syst.) GeV/c^ (VIII.l) 
= 173.5 tli GeV/c2, 

using a novel technique that utilizes the jet energy scale information provided by the hadron- 
ically decaying W boson in the top quark events. This new top quark mass measurement 
provides the most precise single measurement on this important physical parameter. We 
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have performed a cross-check of this result using a more traditional fit that does not use the 
in situ jet energy scale information, and found excellent agreement in the central value of 
the top quark mass: 173.2 +2I (stat.) ±3.3 (syst.) GeV/c^. Finally, by adding an algorithm 
to increase the number of tagged 6 jets, we measure: 173.0 tl'^ (stat.) ±3.2 (syst.) GeV/c^. 

This measurement is part of a rich top physics program at CDF. As the luminosity 
acquired increases from the current 318 pb~^ to an expected 4000-7000 pb~^ for Run II, 
the statistical uncertainty on the top quark mass will improve. Using our technique, the 
dominant systematic uncertainty on the measurement, associated with the jet energy scale, 
will also be reduced with more data. As we approach total uncertainties of approximately 
2.0 GeV/c^, the uncertainties due to initial and final state radiation, as well as the bot- 
tom quark jet energy scale, become comparable to the statistical uncertainties associated 
with the top quark mass and jet energy scale measurement. We expect that these other 
systematics can also be improved with more work and more data in the relevant control 
samples. Additional top quark mass results from CDF are expected in the near future. We 
expect that these will continue to provide important inputs into our understanding of the 
fundamental fermions and the nature of the electroweak interaction. 

Acknowledgments 

We thank the Fermilab staff and the technical staffs of the participating institutions for 
their vital contributions. This work was supported by the U.S. Department of Energy and 
National Science Foundation; the Italian Istituto Nazionale di Fisica Nucleare; the Min- 
istry of Education, Culture, Sports, Science and Technology of Japan; the Natural Sciences 
and Engineering Research Council of Canada; the National Science Council of the Republic 
of China; the Swiss National Science Foundation; the A. P. Sloan Foundation; the Bun- 
desministerium fiir Bildung und Forschung, Germany; the Korean Science and Engineering 
Foundation and the Korean Research Foundation; the Particle Physics and Astronomy Re- 
search Council and the Royal Society, UK; the Russian Foundation for Basic Research; 
the Comision Interministerial de Ciencia y Tecnologia, Spain; in part by the European 
Community's Human Potential Programme under contract HPRN-CT-2002-00292; and the 



75 



Academy of Finland. 



[1] F. Abe et al. (CDF Collaboration), Phys. Rev. Lett. 74, 2626 (1995). 

[2] S. Abachi et al. (DO Collaboration), Phys. Rev. Lett. 74, 2632 (1995), hep-ex/9503003. 

[3] P. Azzi et al. (CDF Collaboration) (2004), liep-ex/0404010. 

[4] F. Abe et al. (CDF Collaboration), Phys. Rev. D50, 2966 (1994). 

[5] T. Affolder et al. (CDF Collaboration), Phys. Rev. D63, 032003 (2001). 

[6] D. Acosta et al. (CDF Collaboration), Phys. Rev. D71, 032001 (2005). 

[7] A. Sill (CDF Collaboration), Nucl. Instrum. Meth. A447, 1 (2000). 

[8] T. Affolder et al. (CDF Collaboration), Nucl. Instrum. Meth. A526, 249 (2004). 

[9] L. Balka et al. (CDF Collaboration), Nucl. Instrum. Meth. A267, 272 (1988). 
[10] S. Bcrtolucci et al. (CDF Collaboration), Nucl. Instrum. Meth. A267, 301 (1988). 
[11] M. G. Albrow et al. (CDF Collaboration), Nucl. Instrum. Meth. A480, 524 (2002). 
[12] G. AscoH et al, Nucl. Instrum. Meth. A268, 33 (1988). 
[13] E. J. Thomson et al, IEEE Trans. Nucl. Sci. 49, 1063 (2002). 

[14] G. Corcella, I. G. Knowles, G. Marchesini, S. Moretti, K. Odagiri, P. Richardson, M. H. 

Seymour, and B. R. Webber, J. High Energy Phys. 01, 010 (2001). 
[15] T. Sjostrand, L. Lonnblad, and S. Mrenna, TP 01-21, LU (2001), hep-ph/0108264. 
[16] M. L. Mangano, M. Moretti, F. Piccinini, R. Pittau, and A. D. Polosa, JHEP 07, 001 (2003). 
[17] E. A. Gerchtein and M. Paulini, ECONF C0303241, TUMT005 (2003), physics/0306031. 
[18] R. Brun, F. Bruyant, M. Maire, A. C. McPherson, and P. Zanarini (1987), CERN-DD/EE/84- 

1. 

[19] G. Grindhammer, M. Rudowicz, and S. Peters, Nucl. Instrum. Meth. A290, 469 (1990). 

[20] A. Abulencia et al. (CDF Collaboration) (2005), submitted to Phys. Rev. D, hep-ex/0508029. 

[21] F. Abe et al. (CDF Collaboration), Phys. Rev. D45, 1448 (1992). 

[22] D. Acosta et al. (CDF Collaboration), Phys. Rev. D71, 052003 (2005). 

[23] F. Abe et al. (CDF Collaboration), Phys. Rev. D50, 2966 (1994). 

[24] D. Buskulic et al. (ALEPH Collaboration), Phys. Lett. B313, 535 (1993). 

[25] J. M. Campbell and R. K. Ellis, Phys. Rev. D60, 113006 (1999). 

[26] B. W. Harris, E. Laenen, L. Phaf, Z. Sullivan, and S. Weinzierl, Phys. Rev. D66, 054024 

76 



(2002). 

[27] A. Bhatti et al, submitted to Nucl. Instrum. Methods A, hep-ex/0510047. 

[28] S. Eidelman et al. (Particle Data Group Collaboration), Phys. Lett. B592, 1 (2004). 

[29] S. M. Oliveira, L. Brucher, R. Santos, and A. Barroso, Phys. Rev. D64, 017301 (2001). 

[30] Y. L. Dokshitzer, Sov. Phys. JETP 46, 641 (1977). 

[31] V. N. Gribov and L. N. Lipatov, Yad. Fiz. 15, 1218 (1972). 

[32] V. N. Gribov and L. N. Lipatov, Yad. Fiz. 15, 781 (1972). 

[33] L. N. Lipatov, Sov. J. Nucl. Phys. 20, 94 (1975). 

[34] G. Altarelh and G. Parisi, Nucl. Phys. B126, 298 (1977). 

[35] S. Frixione, P. Nason, and B. Webber, JHEP 08, 007 (2003). 

[36] J. Pumplin et al, JHEP 07, 012 (2002). 

[37] D. Stump et al, JHEP 10, 046 (2003). 

[38] A. D. Martin, R. G. Roberts, W. J. Stirling, and R. S. Thorne, Eur. Phys. J. C14, 133 (2000). 



77 



